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Be here now 


New views of quantum theory that can be tested and have practical applications bring welcome 


echoes of physics past. 


the Cornell University physicist David Mermin, who coined it to 
describe how many physicists use the mathematics of quantum 
theory without thinking too hard about its deeper implications. 

Mermin himselfis clearly not content to shut up and calculate. He has 
mused on the meanings of quantum theory and of physical theory more 
generally for many decades, notably in his Reference Frame column 
for the magazine Physics Today. Now, on page 421 of this issue, he con- 
siders a new view of quantum theory, called quantum Bayesianism or 
QBism, and what it implies for a long-standing question. Namely, how 
can we reconcile our perception that the present moment is special 
with the relativistic view that space-time is a continuum that reaches 
from past to future, with nothing to privilege ‘the Now’? 

Some will see this as a metaphysical question. Certainly, while the 
gulf remains between the formal machinery of quantum mechanics, 
with its wavefunctions and probabilities, and our conscious experience 
of the world, it is hard to see how the question can be framed with the 
rigour that science usually demands. 

Yet QBism offers a way to put us in the picture, even in the absence 
of a theoretical link between the abstract microworld and the subjec- 
tive macroworld. It suggests that quantum theory is telling us what 
an individual can know about a system in the light of what he or she 
already knows and expects, just as in standard Bayesian probability. 

The idea has been given something of a rough ride by physicists, 
who seem uncomfortable with QBism because they see it as a solipsis- 
tic view of the world. Perhaps Mermin’s advocacy will secure it a more 
sympathetic hearing. At any rate, it has the virtue of refusing to ignore 
quantum theory's long-standing tussle with the role of the observer. 

Aside from the merits of the idea, it is striking that Mermin should 
discuss it at all. Any view of quantum (or indeed classical) physics that 
borders on the metaphysical has long been out of fashion. Yet the early 
architects of quantum theory, such as Niels Bohr, Werner Heisenberg 
and Max Born, had no reservations about examining the philosophi- 
cal issues it raised, and the problem of ‘the Now troubled Einstein. 

Most famously, Bohr and Einstein argued about whether quantum 
mechanics allowed any room for the idea of realism — of an objective 
world that exists independently from our efforts to observe and meas- 
ure it. Bohr insisted that physics was concerned with what we can know, 
and was silent on the matter of ‘how things really are. He, Born and 
Heisenberg made claims about quantum theory’s challenge to causal- 
ity and determinism that today look like a bit of an intellectual stretch. 

Mermin is not alone in admitting such debates back into science. In 
a Perspective article on page 443, physicists Artur Ekert and Renato 
Renner place current work on quantum cryptography in a broader 
context that encompasses the thorny concept of free will. 

Quantum methods of encoding information, they argue, combined 
with “an arbitrarily small amount of free will are sufficient to conceal 
whatever we like”. To enable foolproof secrecy, “free will is our most 


Ts phrase “Shut up and calculate” is popularly associated with 


valuable asset”. This, too, is the kind of claim that a few decades ago 
would have risked being dismissed, if not ridiculed, as idle coffee-room 
chat. It reveals researchers’ new boldness for engaging with the mean- 
ings and corollaries of quantum physics. In doing so, they enrich the 
discussion as Bohr, Einstein and their colleagues did. 

But why now? Interest in the founda- 


“Researchers have _ tions of quantum theory — what it really 
anew boldness tells us about the character of the world 
forengaging with that we experience in reassuringly classi- 
the meanings cal terms — has flourished since the late 
and corollaries of 1980s. That has been driven partly by the 


development of experimental techniques, 
especially in quantum optics, that can test 
ideas about phenomena such as entanglement (the codependence of 
remote quantum states), measurement and wavefunction collapse, and 
which were previously accessible only to theoretical speculation. As a 
result, physicists can more clearly see the most fundamental features 
of quantum theory — in particular the nonlocality and contextuality 
(contingency on how results are obtained) of quantum systems. 

The other driver is an emphasis on quantum theory as a theory of 
information: of what we can know, transmit and share. This view has 
already thrown up practical applications such as quantum crypto- 
graphy and rudimentary quantum computers. But it has also reawak- 
ened long-deferred foundational questions in new guises. It shows 
us that Bohr and Einstein could already see the ramifications for the 
philosophy and epistemology of science. Lacking the experimental 
tools to make progress, they doubted that these issues could ever be 
much more than metaphysical. Now they can be, and it is right that 
scientists should have the confidence to raise them afresh. m 


quantum physics.” 


Wheat lag 


Growth in yields of the cereal must double if the 
Green Revolution is to be put back on track. 


heat is widely considered to be the world’s most important 
Wee and Norman Borlaug knew a thing or two about how 

to grow it. The US agronomist developed varieties that 
could better resist disease and gave higher yields. In doing so, he saved 
an estimated one billion people from starvation. 

This week marks a century since Borlaug’s birth, so what better time 
to consider why millions still go hungry, and to ponder how the next 
Green Revolution can be kick-started? Ata meeting in Mexico this week, 
organized by the International Maize and Wheat Improvement Center 
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(CIMMYT) in collaboration with the Association for Agricultural 
Research and Experimentation of the State of Sonora, researchers will 
look again at the prospects for wheat. Although wheat consumption is 
growing, the investment needed to build on Borlaug’s legacy is scarce. 

Wheat provides 20% of the dietary energy for the world’s popula- 
tion. Yet growth in yields has stagnated at around 0.9% per year over 
the past decade — by contrast, maize (corn) yields grow by almost 
double that at approximately 1.6% per year. To meet future demand 
for food, researchers say that wheat yields must grow by 1.7% each 
year. That will require investment. The total global spend on wheat 
breeding and research, around US$500 million per year, is currently 
one-quarter of that spent to improve maize. 

The discrepancy arises because seed companies can make higher 
profits from maize than from wheat. Maize is a hybrid crop that pro- 
duces seeds with poor yields, so there is little incentive for farmers 
to keep and replant them. To get the best results, they must buy new 
maize seeds each year. Seed producers get no such annual income 
from wheat because farmers can reap and replant seeds from several 
successive harvests without losing much yield. 

Existing solutions to this are as much economic as agronomic. One 
strategy is for plant breeders to collect royalties from farmers who save 
seeds — as is done in countries such as Australia and the United King- 
dom. Although this approach is reasonable for farmers who harvest 
substantial profits, it is less applicable to subsistence farmers in the 
developing world; for them, science might provide better solutions. 

An international research effort to boost wheat yields by 50% by 
2035 will be officially launched at this week’s meeting. The Interna- 
tional Wheat Yield Partnership — a consortium of research institu- 
tions including CIMMYT and the United Kingdom’s Biotechnology 
and Biological Sciences Research Council — is a long-term project 
that aims to raise US$100 million in the next five years. 

Researchers plan to use the cash to explore, for example, how to 
improve the way wheat photosynthesizes. Wheat, frankly, does it 
badly; it converts just 1% of incoming light to grain. Maize is three 
times more efficient; sugar cane eight times. 

The map of the wheat genome remains a job half-finished. An 


estimated US$50 million is needed to map the remaining 20 of wheat’s 
42 chromosomes. Obtaining the complete genome should accelerate 
the effort to make wheat more tolerant to heat and drought through 
both conventional breeding and genetic-modification techniques. 
Crucial work to monitor the spread of wheat pathogens must con- 
tinue. Borlaug championed this research through the Durable Rust 
Resistance in Wheat project led by Cornell University in Ithaca, New 
York, and the Borlaug Global Rust Initiative, 


“The total s' lobal a consortium of more than 20 institutions, 
sp endon wheat which has its secretariat at Cornell. Famously, 
breeding and he said that “rust never sleeps”, and the con- 
research is tinued spread of the devastating Ug99 stem- 
currently one- rust fungus in Africa and other new strains in 
quarter of that Ethiopia (see page 404) demonstrate the truth 
spenttoimprove — ofhis words. 

maize.” Today, yield is not the only impact by which 


agriculture is judged. The Intergovernmental 
Panel on Climate Change will next week report on farming’s direct and 
indirect consequences for the planet. The focus on the carbon footprint 
of food production is necessary, but the message cannot be boiled down 
to the misleading idea that all agricultural techniques bar organic farm- 
ing are a plague on the environment. 

Borlaug believed that increasing yields through the use of new and 
improved varieties, along with the responsible application of fertilizer 
and pesticides, could benefit natural ecosystems. Less land would need 
to be converted into agricultural production to grow food. Recent 
studies suggest that he was right (J. R. Stevenson et al. Proc. Natl Acad. 
Sci. USA 110, 8363-8368; 2013). 

Misguided opposition to some aspects of big agriculture, such as 
chemical-fertilizer use in the developing world, could stop many 
nations from growing the food they need. Objections should be saved 
for the irresponsible overuse of fertilizer in the West and Asia, which 
has led to widespread water and air pollution. 

One of Borlaug’s guiding principles was that food is the moral right 
of all who are born into this world. He won an important battle against 
hunger, but the war continues. = 


A parlous state 


The decay at ancient Pompeii is symbolic of a 
deeper malaise in Italy’s heritage. 


be remembered more for what we destroy than what we create”. 

For anyone who has been following the saga of ancient Pompeii 
over the past few years, that observation has the bite of reality. Report 
after report has appeared in the press detailing the collapse of this wall, 
the closure of that house or, in one extreme case in 2010, the destruc- 
tion of a whole building. 

Pompeii, which attracts some 2.3 million visitors a year, may be 
adept at grabbing the headlines, but the insidious creep of heritage 
erosion is far from limited to the Bay of Naples — in fact, it is a problem 
that encompasses the archaeological wealth of the globe. 

Heritage is ensnared in a particularly vicious catch-22. On the one 
hand, ruins are a key source of inspiration and income, as tourists flock 
to connect with the past. On the other, that throng so in thrall to history 
does untold damage merely by turning up to havea look. But even with- 
out hordes of tourists inflicting wear and tear on fragile remains, ruins 
face a determined and dangerous enemy in the shape of the elements. 

As soon as an artefact is unearthed, decay sets in. Objects and 
buildings that have lain preserved for centuries beneath the earth 
immediately begin to suffer when exposed to the atmosphere. A few 


Cr Palahniuk, the US author of Fight Club, noted that “we'll 


400 | NATURE | VOL 507 | 27 MARCH 2014 


kilometres down the road from Pompeii, the site of Herculaneum, 
which was buried by the same volcanic eruption in aD 79, is faring 
better than its more famous sibling. True, the exposed site is much 
smaller, but with a high number of multi-storey buildings and a lot of 
preserved organic matter, Herculaneum presents unique challenges. 
The site’s enhanced ability to hold decay at bay is largely down to a 
public-private venture launched in 2001 and the multidisciplinary 
approach that the ensuing project has taken. 

That success is an object lesson not only for Pompeii, which hopes to 
set up a similar project to help preserve the ruins (see page 411), but also 
for the malaise that grips Italy’s heritage as a whole. Hampered by debt, 
the country lacks the finances to reverse the decay afflicting its monu- 
ments. Instead, it must rely on external funds and philanthropists. The 
crumbling Colosseum in Rome, for example, is being repaired thanks 
to some €25 million (US$35 million) from billionaire Diego Della Valle. 
Yet this is still not enough. Just west of Rome, ancient Ostia is beset by 
weeds, and was even flooded earlier this year. 

Although structural repairs and conservation at any site require 
money and coordinated expertise, fending off many of the agents 
of erosion — by controlling vegetation, for example, or employing 
more guards to keep tourists in check — is relatively easy to achieve. 
Such steps would be important gains in this war of attrition. Finding 
ways to maximize the revenues coming into the sites and relaxing 
the austerity law stopping the employment of 
public-works personnel would be a start. For if 
preventable problems are not tackled urgently, 
then no amount of money from external sources 
will be able to reverse the tide of decay. m 
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That is good news, and enough to meet the United Nations 
Millennium Development Goal that called “to halt and begin 
to reverse the incidence” of TB. But the pace of progress is too slow. 
Some 8.6 million people contracted the disease in 2012, and 1.3 mil- 
lion died, including 320,000 people with HIV. As it stands, the world 
will miss the international target to eliminate TB by 2050. To meet that 
ambitious goal, we need to modernize the way in which we tackle the 
disease. This means that fundamental research must play a bigger part 
in nurturing the development of diagnostics, medicines and vaccines. 
Some clinicians in the TB field still do not recognize the power 
of basic research in developing tools for patients. Indeed, the slow 
pace of the research often frustrates health workers. In return, basic 
researchers may not see how their efforts can 
directly contribute to TB management. The 
divide between the two approaches sees them 
often considered at odds with each other. 

The past ten years have seen a remarkable rise 
in the number of research, financing, technical 
and programme partners involved in build- 
ing a major pipeline for TB tools. Innovative 
public-private partnerships and community 
activism have stimulated faster and more equi- 
table access to such tools. However, without con- 
tinued studies into the molecular nature of TB, 
no new interventions will become available to 
health-care professionals. At a time when health 
systems still miss about one-third of the new 
cases each year and as drug-resistant TB strains 
are signalling a fresh public-health crisis, basic 
research must be better harnessed and promoted 
to develop new diagnostic, therapeutic and preventative tools. 

To help catalyse this change, the World Health Organization (WHO) 
has made research one of the three fundamental pillars of its proposed 
post-2015 global TB strategy. This research pillar seeks to promote the 
discovery, development and rapid adoption of new tools for TB control. 
Never before has the WHO so clearly addressed the need for fundamen- 
tal research in the fight against TB, and it hopes that doing so now will 
encourage crucial communication across the community. Clinicians 
and public-health specialists need to help to identify areas in which 
fundamental research can provide answers, but they also need to learn 
the rhythms and language of basic research — the constraints, processes 
and timescales. In exchange, fundamental researchers must learn how to 
give their studies the most impact on clinical needs and disease burden. 

This will require fresh thinking on how to 


3 ewer people will contract tuberculosis (TB) this year than last. 


fund and manage cross-disciplinary science. DNATURE.COM 
We need to encourage researchers to assemble _ Discuss this article 
multidisciplinary teams to work on key public- _ online at: 

health research themes with defined outcomes. _g0.tlafure.com/it65i4 


DEVELOPMENT OF 


OPTIMAL 


TOOLS TO PREVENT, 
DETECT AND TREAT TB 


DEPENDS 


HEAVILY ON PUBLIC 
AND PRIVATE 


FUNDING. 


Fundamental research is 
the key to eliminating TB 


To speed up progress to tackle the disease, basic researchers and those on the 
‘front line must talk to each other more, says Christian Lienhardt. 


Longitudinal studies of patients and households can be used, for 
example, to identify biomarkers that characterize the various clini- 
cally relevant stages of TB, together with studies on how to prevent 
transmission and disease development. The power of this type of study 
has been illustrated in the Framingham Heart Study, established in 
1948 to identify the factors behind cardiovascular disease. 

A coordinated approach could help to prevent TB. Understanding the 
biology of Mycobacterium tuberculosis — the function of its intracellular 
machinery and the biochemical pathways critical for its survival and 
pathogenesis — is essential to combat the pathogen. More-sophisticated 
studies of the host immune response could help to identify biological 
mechanisms in people who become infected but never develop active 
TB, and might reveal how vaccines could mimic the immune response. 

In diagnosis, simple tests that use small quanti- 
ties of body fluids to detect infection would help 
to prevent progression to active disease. Rapid 
identification of drug resistance is needed to 
initiate appropriate, safe and effective treatment. 

In treatment, basic and translational research 
must focus on improving options for people 
with drug-sensitive and drug-resistant TB alike. 
These efforts should go beyond the development 
of better drug candidates and combinations and 
study why TB cannot be treated with just a few 
weeks of chemotherapy — as most bacterial 
infections can be. 

None of these studies, however, will be pos- 
sible without the close collaboration of patients, 
physicians and basic, translational and clinical 
researchers, to jointly guide the rational approach 
to identification of new diagnosis, treatment and 
preventive options. Broad, cross-discipline research spans a range of 
funding agencies and cannot be supported by individual funders. To 
facilitate the funding of cross-cutting work, projects could be designed 
on the basis of the disciplines involved, and scientists could seek funds 
from agencies that support work in these respective disciplines, such 
that multiple funders could ultimately be involved in larger projects. 

Crucially, the development of optimal tools to prevent, detect and 
treat TB depends heavily on public and private funding; the financing 
gap for TB research is now estimated at about US$1.4 billion per year. 
The WHOs objective in promoting this crucial dialogue and change is 
to bring together the clinical, public-health and research communities 
to promote and expand TB research and its funding. But the agency also 
hopes that fundamental researchers from many other scientific fields 
will consider applying their work to TB. The world needs them. m 


Christian Lienhardt is senior research adviser at the WHO Global 
TB Programme in Geneva, Switzerland. 
e-mail: lienhardtc@who.int 
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Aircraft spy on 
tropical storms 


Aircraft flying over tropical 
oceans could significantly 
improve the accuracy of 
storm forecasts by making 
high-resolution atmospheric 
measurements. 

Jennifer Haase at the Scripps 
Institution of Oceanography 
in La Jolla, California, and her 
team developed an airborne 
Global Positioning System 
device, which uses radio signals 
to sense and precisely locate 
atmospheric conditions. In 
2010, during research flights 
into the storm cell over the 
Caribbean Sea that developed 
into Hurricane Karl, the team 
found that their system agreed 
with data from spaceborne 
systems, and it increased the 
number of atmospheric profiles 
taken in the storm region by 
more than 50%. If deployed 
on commercial aircraft, the 
technique could produce huge 
amounts of data for use in 
meteorology, the authors note. 
Geophys. Res. Lett. http://doi. 
org/rzd (2014) 


How squid control 
their shine 


Squid control the 
dynamic iridescence 
of their skin 
separately from the 
way they control their 
pigmentary colours. 
Paloma 

Gonzalez-Bellido 
at the University of 
Cambridge, UK, 
Trevor Wardill at the 
Marine Biological 
Laboratory in 
Woods Hole, 
Massachusetts, 
and their 
colleagues show 
that iridescence 


Selections from the 
scientific literature 


GENETICS 


Loblolly takes genome size prize 


A species of pine tree native to the southeastern 
United States has a genome with 23 billion 
base pairs, more than 7 times the length of the 


human genome. 


Kristian Stevens at the University of California 
in Davis and his team sequenced and assembled 
the genome of the loblolly pine (Pinus taeda; 
pictured), mainly using the tissue from a single 


pine seed. 


in Atlantic longfin squid 
(Doryteuthis pealeii) is partly 
controlled outside the brain 
through the stellate ganglion. 
Severing input to this ganglion 
on one side of the body results 
in the loss of iridescence 

on that side (pictured) 

within 10 minutes, with the 
structures responsible for 

this iridescence becoming 
transparent. Both iridescence 
and the control of pigmentary 
colours require input from 
the brain through the pallial 
nerve, but the iridescence 
signal is routed through the 
stellate ganglion. By contrast, 
the signals for skin-pigment 
changes are controlled 
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Another team, made up of many of the 
same researchers and led by Jill Wegrzyn at the 
University of Connecticut in Storrs, characterized 


around 50,000 of the genes and estimated 


that 82% of the loblolly genome is made from 
repetitive elements. This work, the first pine 
genome assembled so far, provides a foundation 
to study the biology of conifers, the authors say. 


Genetics 196, 875-890; 891-909 (2014) 


separately through the 
fin nerve. 
J. Exp. Bio. 217, 850-858 (2014) 


Strong, stiff and 
shell-like ceramic 


A ceramic material created 
using ice crystals as a template 
has demonstrated a rare 
combination of high strength, 
toughness and stiffness. 
Sylvain Deville of the 
French National Centre 
for Scientific Research in 
Cavaillon and his colleagues 
made the material from 
alumina platelets, alumina 
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nanoparticles and a silica 
calcia glass. When they froze 
the material, ice crystals grew 
that caused the platelets to 
align into layers, trapping the 
nanoparticles between them. 
These layers can be pressed 
and sintered into pieces a few 
centimetres in diameter and a 
few millimetres thick. Under 
the microscope this material 
resembles nacre, which makes 
up many seashells, and is as 
strong and tough as many 
aluminium alloys used in 
engineering, but with higher 
hardness and stiffness, which 
it maintains up to 600°C. 
Nature Mat. http://doi.org/r29 
(2014) 
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Protein seed as 
Alzheimer’s test 


A test that exploits the 
abnormal stickiness of 
misfolded proteins could 
one day be used to diagnose 
Alzheimer’s disease, reports a 
team led by Claudio Soto at the 
University of Texas Medical 
School in Houston. 

The team adapted 
an amplification assay, 
originally developed for 
prion diseases. The new 
test used certain proteins as 
seeds that trigger clumping 
of malformed amyloid-f 
proteins, creating aggregates 
that are typical of the disease. 
When tested on patient 
samples from three centres, 
the assay distinguished 
the cerebrospinal fluid of 
patients with Alzheimer’s 
from that of people with 
other neurological diseases, 
with 90% sensitivity and 92% 
specificity. The researchers 
are investigating why the 
test did not work on samples 
from a fourth centre. 
Cell Rep. http://doi.org/r3b 
(2014) 


Writing with 
water and silk 


Electron beams can create 

very fine nano-structured 

patterns on thin films of silk, 

with only water required to 

complete the etching process. 
Fiorenzo Omenetto 

from Tufts University in 

Massachusetts and his 

team say that their silk- 

and-water system avoids 

the toxic chemicals and 


complex processing steps 
needed in other forms of 
lithography. The authors 
created photonic lattices 
(pictured), which manipulate 
light, with resolution as 

high as 30 nanometres. 

They also etched patterns 

out of silk films to which 
active materials, such 

as enzymes, proteins or 
quantum dots had been added. 
Nature Nanotechnol. http://doi. 
org/r3c (2014) 


Chickens’ Pacific 
voyaging detailed 


Suggestions that ancient 
Polynesians had contact 
with South America 
could have been biased by 
contaminated samples of 
ancient-chicken DNA, say 
Alan Cooper and Jeremy 
Austin at the University of 
Adelaide, Australia, and their 
colleagues. 

The authors sequenced 
genetic information from 
37 ancient chicken bones and 
124 modern samples, all from 
Polynesia and the islands of 
southeast Asia. They found 
a unique and distinctive set 
of DNA variations in all of 
the ancient and many of the 
modern specimens. This 
‘Polynesian motif’ is not found 
in early South American 
chickens, suggesting that 
humans transported chickens 
from Micronesia across the 
Pacific Ocean, but only as 
far as Easter Island (Rapa 
Nui). Contamination of some 
ancient samples with modern- 
chicken DNA probably 
explains previous suggestions 
that the migration continued 
all the way to South America, 
the authors say. 
Proc. Natl Acad. Sci. USA http:// 
doi.org/rx6 (2014) 


Early quasars ate 
like the rest 


The giant black hole in the 
most distant-known quasar, 
which formed just 750 million 
years after the Big Bang, 
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Climate change endangers culture 


> HIGHLY READ 
on www.lOP.org 


in March 


Many of the world’s most important cultural 
sites could find themselves below sea level 
owing to climate change. 


Using high-resolution topography and 
predictions of sea-level rise as a result of global warming, Ben 
Marzeion at the University of Innsbruck, Austria, and Anders 
Levermann of Potsdam University, Germany, calculated the 
temperature at which each cultural site on the World Heritage 
List would be affected by swelling oceans. The authors found 
that if current global mean temperatures were sustained for two 
millennia, around 40 sites, 6% of the total, would be damaged if 
left unprotected. If global temperatures warmed by 3 kelvins the 
number of sites would rise to 136, or 19% of the total. 


Environ. Res. Lett. 9,034001 (2014) 


engulfed matter at the same 
rate as much younger quasars. 
Alberto Moretti of the Brera 
Astronomical Observatory 
in Milan, Italy, and his 
colleagues used the European 
Space Agency’s XMM- 
Newton observatory to study 
ULAS J1120+0641, a galactic 
centre that produces huge 
amounts of radiation powered 
by a supermassive black hole. 
They found that its X-ray 
spectrum, an indication of 
the rate at which the black 
hole sucks in matter, was 
indistinguishable from those 
of quasars seen later in the 
life of the Universe. The 
authors had expected that 
ULAS J1120+0641 would 
collect matter ata much 
higher rate because its mass 
is 2 billion times that of the 
Sun. The slow growth rate 
of the galactic centre raises 
questions about how it could 
have reached its huge size so 
early in the Universe's life. 
Astron. Astrophys. 563, A46 
(2014) 


HIV vaccine 
success secrets 


The production of certain 
antibodies could explain 
the partial success of an 
HIV-vaccine trial and the 


failure of another. 

In 2009, researchers reported 
that an experimental HIV 
vaccine had reduced infection 
risk by 31%, the only HIV 
vaccine ever shown to be 
effective. Two teams have 
now compared the immune 
responses of people in that trial, 
known as RV 144, with those of 
participants in a different trial, 
VAX003, in which a vaccine did 
not prevent infection. Georgia 
Tomaras at Duke University 
in Durham, North Carolina, 
and her colleagues found that 
volunteers in the RV 144 trial 
produced a greater response, 
in terms of IgG3 antibodies, 
which recognize a portion of 
HIV’s outer shell, than people 
in the unsuccessful trial. 

Another team, led by Galit 
Alter at Massachusetts General 
Hospital, Boston, also found 
that the differences between the 
two vaccines may be explained 
by higher levels of IgG3, and by 
another antibody called IgG1, 
in the successful trial. These 
antibodies might have spurred 
other immune cells to vanquish 
cells infected with HIV, both 
teams conclude. 

Sci. Transl. Med. 6, 228ra38; 
228ra39 (2014) 
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SEVEN DAYS nescnnss 


Nuclear handover 


Japan will relinquish more 
than 300 kilograms of 
weapons-grade plutonium 
and around 200 kilograms 
of highly enriched uranium 
to the United States. The 
handover was announced 
on 24 March at the Nuclear 
Security Summit at The Hague 
in the Netherlands. Japan 
had kept the materials for 
use in an experimental Fast 
Critical Assembly reactor in 
Tokaimura. But the facility is 
currently not operating, and 
Japan’s stockpile has raised 
concerns about potential 
terrorist threats and nuclear- 
weapons ambitions. 


Climate-data site 


The US government launched 
a pilot website (http://climate. 
data.gov) on 19 March to 
publish and visualize open data 
on climate change — a move to 
increase public awareness and 
promote community planning 
around the issue. Operated by 
NASA and the US National 
Oceanic and Atmospheric 
Administration, the website 
will initially focus on coastal 
flooding and sea-level rise, 
using data from various federal 
agencies. It aims to produce a 
series of web-based tools for 
climate planning. 


Telescope trauma 
The budget for NASA’s Wide- 
Field Infrared Survey Telescope 
(WFIRST) could spiral out 

of control if the mission 
expands to include an inherited 
telescope and an instrument 

to hunt for extrasolar planets, 
concludes a report released on 
18 March by the US National 
Research Council. WFIRST is 
likely to be the agency’s next 
big astrophysics spacecraft 
after the James Webb Space 
Telescope launches in 2018. 


A Pa 


Wheat-disease warning 


Wheat farmers in East Africa and the Middle 
East must be vigilant for a pathogenic fungus 
that has wiped out more than 10,000 hectares of 
the crop (pictured) in southern Ethiopia since a 
2013 outbreak. The warning comes in a report to 
the Borlaug Summit on Wheat for Food Security 
in Mexico on 25-28 March. The strain causing 
stem-rust disease in Ethiopia — sub-Saharan 


Africa's largest wheat producer — is not the 
virulent Ug99 strain that is spreading across the 
globe, says the report from a coalition of research 
organizations, including the International Maize 
and Wheat Improvement Center in El Batan, 
Mexico. The disease could spread to countries 
such as Kenya and Uganda, where farmers are 
planting vulnerable varieties of wheat. 


NASA should expect to spend 
at least US$2.1 billion on it, 
says the report, even if officials 
repurpose a 2.4-metre 
telescope donated by the 
National Reconnaissance 
Office, the US agency 
responsible for intelligence 
satellites. 


Collection access 


US government agencies 

must produce new rules 

for maintaining scientific 
collections of everything 

from Moon rocks to dinosaur 
fossils. On 20 March, John 
Holdren, director of the White 
House Office of Science and 
Technology Policy, ordered 
each agency with a permanent 
collection to produce a draft 
management and access policy 
within six months. The goal is 
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to better preserve and increase 
access to artefacts including 
rocks, tissue specimens, seeds 
and other objects retained for 
scientific inquiry. 


Top-quark mass 
Physicists have produced the 
most precise measurement yet 
of the mass of the top quark 

— the heaviest fundamental 
particle — by combining 

data from four experiments 

at the world’s two biggest 
particle colliders. The new 
measurement, 173.34+0.76 
GeV c? (310.28 £1.36 10” 
kilograms), has a relative 
uncertainty of 0.44%. This 
compares with uncertainties of 
0.50% and 0.55%, respectively, 
in measurements made at 
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the (now closed) Tevatron at 
Fermilab near Batavia, Illinois, 
and at the Large Hadron 
Collider at CERN, Europe's 
particle-physics laboratory 
near Geneva, Switzerland. The 
result was posted on the arXiv 
preprint server on 18 March 
(The ATLAS, CDE, CMS and 
DO Collaborations http://arxiv. 
org/abs/1403.4427; 2014). 


Genome editing 
The Berkeley and San 
Francisco campuses of the 
University of California have 
announced a US$12-million 
project focused on the 
CRISPR/Cas9 gene-editing 
technology, the powerful 
technique that allows targeted 
rewriting of genomes (see 
Nature http://doi.org/ 

1z6; 2014). The Innovative 


DAVE HODSON, CIMMYT 


MARCIO JOSE SANCHEZ/AP 


SOURCE: MRC 


Genomics Initiative will 
receive $10 million from the 

Li Ka Shing Foundation in 
Hong Kong, with the rest made 
up by the two universities. 
Jennifer Doudna, the Berkeley- 
based executive director of the 
initiative, says that it will focus 
on developing the technology 
for human-health applications 
and creating a library of 
research resources so that 
others can use the technique. 


EVENTS 


Ebola outbreak 


Dozens of people have died 
in an outbreak of the Ebola 
virus in southern Guinea. The 
World Health Organization 
in Geneva said on 24 March 
that 86 suspected cases 
resulting in 59 deaths have 
been reported. The situation 
was changing rapidly as 
Nature went to press. The 
numbers are likely to change 
as new cases are identified 
and suspected infections are 
ruled out through laboratory 
testing. Possible Ebola cases 
in neighbouring Liberia and 
Sierra Leone are also being 
investigated. 


US drought 


There is no end in sight for 
the drought ravaging the 
southwestern United States, 
the US National Oceanic and 
Atmospheric Administration 
(NOAA) said on 20 March 

in its Spring Outlook report. 


TREND WATCH 


More than 2.2 million people 
in the United Kingdom are 


voluntarily taking part in large- 
scale population-cohort studies, 


making for an “unparalleled 
collection” of data on health 


California has experienced 

its warmest and third-driest 
winter on record, with water 
reserves depleted (pictured) 
and temperatures from 
December to February 
averaging 8.9°C — almost 
half a degree warmer than 

the previous record, set in 
1980-81. NOAA also said that 
rivers in half of the continental 
United States are at minor or 
moderate risk of flooding this 
spring. The southern Great 
Lakes regions are at greatest 
risk, the report says. 


Polio in Iraq 


Iraq last week reported its 
first case of polio in 14 years. 
A six-month-old boy, who 
had not been vaccinated 
against the disease, developed 
paralysis in February. The 
virus was also isolated from 
his asymptomatic three-year- 
old sister. The case represents 
the first regional spread of 
polio since an outbreak of the 
crippling disease began last 
October in Syria, where, as of 


20 March, 37 cases have been 
reported by the World Health 
Organization. 


Meningitis vaccine 
The United Kingdom could 
become the first country to 
routinely immunize infants 
with a broad-spectrum vaccine 
against meningitis B. The Joint 
Committee on Vaccination 
and Immunisation said on 

21 March that the vaccine 
Bexsero — developed by 
Novartis in Basel, Switzerland 
— should be used if it can be 
purchased at low cost. Last 
year, after the vaccine was 
approved in Europe, the UK 
committee had said that it was 
not worth its price. Novartis 
plans to seek approval for the 
vaccine in the United States 
later this year, after regulators 
approved its emergency use in 
2013 to quella meningitis B 
outbreak at universities. 


FUNDING 
Forest funding 


Countries must pay 

US$30 billion a year for 
tropical-forest conservation by 
2020 to halt deforestation and 
promote rural development, 
recommends a report from the 
United Nations Environment 
Programme. The report, 
published on 21 March, comes 
four months after the United 
Nations climate summit in 


YOUNG MEN NEEDED IN UK COHORT STUDIES 


3.5% of the UK population takes part in medical cohort studies, 


but few young men are participating. 


and well-being, according to a 
review of 34 studies by the UK 
Medical Research Council. The 
earliest cohort study mentioned 
in the review, which was 
published on 21 March, began 
in 1946 and is still running. 
However, men aged 20-40 

are underrepresented in such 
studies (see chart). 
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Data from 26 cohort studies. The Million Women Study (1.36 million women over 50) and UK 
Biobank (500,000 men and women aged 45-69) are not shown: they eclipse all other studies. 


SEVEN DAYS | THIS WEEK | 


30 MARCH 

As part of its fifth 
assessment, the 
Intergovernmental Panel 
on Climate Change 
will publish a report 
on the vulnerability 
and impacts of climate 
change on human and 
natural systems. 
go.nature.com/cu4bhk 


27 MARCH 
Southeast Asia will 

be declared polio free 
by the World Health 
Organization. The last 
wild polio case was 
reported from India on 
13 January 2011. 
go.nature.com/Suxoar 


Warsaw, where nations agreed 
on the REDD+ scheme to 
make carbon payments to 
developing countries that 
reduce deforestation and forest 
degradation (see go.nature. 
com/pfswg2). Just $6.27 billion 
has been promised to the 
scheme so far. 


Turing centre 

In its budget report on 

19 March, the UK government 
announced a £42-million 
(US$69-million) ‘big data 
institute, to be named after 
British computer scientist 
Alan Turing. The government 
will also spend £55 million on 
a centre to manufacture cell 
therapies for late-stage clinical 
trials, £19 million on a centre 
to help to commercialize the 
carbon-monolayer material 
graphene, and £106 million 
on PhD-training centres. The 
funding will be spread over 
five years. But researchers 
remain disappointed that 
funding for basic science is 
frozen at £4.6 billion per year 
for the next two years, the 
same level it has been at since 
2010. See go.nature.com/fefnls 
for more. 


> NATURE.COM 
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Marijuana for use in US research is grown at only one federally approved facility, at the University of Mississippi in Oxford. 


Federal red tape ties up 
marijuana research 


Despite drug’s legalization in two US states, biomedical science faces continued restrictions. 


BY HELEN SHEN 


idden in a locked room on New 
He University’s campus in lower 
Manhattan, a half-tonne steel safe 
guards hundreds of vials of a drug extracted 
from marijuana. To the US government, the 
drug — Epidiolex — is one of the world’s most 
dangerous substances, ripe for abuse. But to 
Orrin Devinsky, a neurologist at the univer- 
sity, itis a potential treatment for tremors in 
children with severe epilepsy. 
Although Epidiolex, made by GW 


Pharmaceuticals in Salisbury, UK, is not 
psychoactive, it took Devinsky six frustrating 
months to secure local and national approval 
for a clinical trial that began in January. “Ifsome 
teenagers broke in, theyd get nothing out of it? 
he says. The physical and regulatory barriers 
that nevertheless surround Epidiolex point to 
the stigma that continues to stymie research on 
marijuana and its derivatives, even in US states 
that have moved to legalize its sale and use, such 
as Colorado and Washington (see ‘Slow burr). 

Colorado's legal marijuana sales began in Jan- 
uary and are booming. The state’ legislators are 


weighing a proposal to provide US$7 million to 
study the drug's therapeutic potential. But even 
if the measure is enacted, Colorado research- 
ers might not be permitted to use locally grown 
plants for their studies, and would need to 
clear several levels of government approval 
before running any clinical trials. The US Drug 
Enforcement Agency (DEA) still classifies mari- 
juana alongside heroin and LSD asa substance 
with “no currently accepted medical use and a 
high potential for abuse”. 

“It’s a terrible, terrible paradox,” says 
Devinsky. “The federal government is 
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SLOW BURN 


Even in states where 
marijuana is legal for 
adults to use without 
restrictions, federal 

rules hamper research 
on the drug. 


Washington 


a Marijuana legal 
for adult use 


Hawaii 


> severely limiting the research that we can do?” 

Currently, the US National Institute on 
Drug Abuse (NIDA) runs the nation’s only 
federally sanctioned marijuana farm, pro- 
ducing a research supply through a contract 
with the University of Mississippi in Oxford. 
To obtain the strain, researchers — even those 
with private or local funding — must gain 
approval from the Department of Health 
and Human Services (HHS) or the National 
Institutes of Health, as well as the Food and 
Drug Administration (FDA). The DEA and 
local drug-enforcement officials must also 
approve research facilities for secure storage 
and handling of marijuana. The approval 
process can last for years. 

Suzanne Sisley, a clinical psychiatrist at the 
University of Arizona College of Medicine in 
Phoenix, knows this well. In 2011, she secured 
approval from the FDA to conduct a trial of 
marijuana to treat post-traumatic stress dis- 
order in US military veterans. But it was not 
until two weeks ago — nearly three years later 
— that she received clearance from the HHS. 
Although Sisley already has plenty of volun- 
teers for her 70-person trial, her laboratory 
will have to pass inspections by drug officials 
before it can receive limited quantities of the 
only federally approved marijuana strain. 

That variety contains relatively high levels 
of tetrahydrocannabinol (THC), marijuana’s 
primary active ingredient, and low levels of 
cannabidiol (CBD), another potentially thera- 
peutic component. Such controlled cultiva- 
tion helps to maintain chemical consistency 


> 


MORE 
ONLINE 


laws in effect 


*States affected: Alaska, Arizona, California, Connecticut, Delaware, Hawaii, Illinois, Maine, Maryland, Massachusetts, 
Michigan, Montana, Nevada, New Hampshire, New Jersey, New Mexico, Oregon, Rhode Island, Vermont. 


in research marijuana, explains Mahmoud 
ElSohly, who directs the NIDA Marijuana 
Project at the University of Mississippi. 

Still, he anticipates imminent research 
demand for more plant varieties, especially 
some containing higher concentrations of 
CBD. Emerging data suggest that the chemi- 
cal exhibits anticonvulsant, anti-inflammatory 
and pain-relieving effects, while lacking — or 
perhaps even countering — the psychoactive 
effects of THC’*. A few highly publicized 
anecdotes involving the use of marijuana 

products for treating 


“The federal severe epilepsy in 
government is children have also 
severely limiting _ stoked intense public 
the research interest in high-CBD 


marijuana. As reports 
surface of families 
moving to Colorado to gain access to CBD for 
their children, Devinsky and other researchers 
see an urgent need to accelerate scientific inves- 
tigations of the substance. 

ElSohly has already prepared one variety of 
marijuana that contains equal amounts of CBD 
and THC, and is hoping to grow a high-CBD, 
low-THC variety later this year. “With all the 
publicity going on right now, I’m sure some- 
body is going to request that,” he says. 

But the diversity of recreational marijuana 
strains flooding the Colorado marketplace — 
such as ‘lemon skunk, ‘golden goat’ and “bubba 
kush’ — will not help to fill the research void. 
In a memo earlier this month, the University 
of Colorado reminded its faculty members 


that we can do.” 


to pursue studies only through federally 
approved channels, although it acknowledged 
that the NIDA supply may not be representa- 
tive of many marijuana strains available to peo- 
ple in the state. “As a university that receives 
federal financial aid and federal research 
funding, we have to abide by federal laws,” says 
Ken McConnellogue, a university spokesman. 

One field that does seem likely to flourish 
in Colorado and Washington is public-health 
research. Officials in both states are preparing 
surveys and studies to track long-term rates of 
marijuana use as accessibility and social accept- 
ance of the drug increase, and as large-scale data 
become available for the first time. “We have 
more people using marijuana, and people are 
more willing to tell you that they're using it,’ says 
Laura Borgelt, a clinical pharmacist at the Uni- 
versity of Colorado in Aurora, who is interested 
in studying the effects of maternal marijuana 
use during pregnancy on child development. 

Other topics of interest include possible 
changes associated with marijuana in the use of 
alcohol and other drugs, trends in car accidents 
and risky behaviour, and school performance. 
Last year, NIDA awarded more than $1 million 
in supplemental funding to five groups to study 
some of these questions in the context of the 
natural epidemiological experiments under 
way in Colorado and Washington. 

Few data exist to predict the effects oflegal- 
izing the drug’s use and sale. Studies of the 
‘non-enforcement’ cannabis system in the 
Netherlands by Robert MacCoun, a social 
psychologist at the University of California, 
Berkeley, suggest that marijuana consumption 
rose in the 1980s, as marijuana shops became 
widespread’. Overall, however, rates of mari- 
juana use in the Netherlands are comparable 
to those in other European countries. 

Cultural differences make it difficult to 
extrapolate those findings to the United States, 
warns MacCoun. His group has begun to study 
patterns of drug use in Washington, and he is 
looking forward to seeing data from Colorado 
researchers.“I'm watching quite eagerly to see 
what we can learn.” m 
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The dust-covered Opportunity rover aims to travel 3 more kilometres on Mars to reach clay deposits. 


PLANETARY SCIENCE 


NASA missions 
bid for extensions 


Scientists must show projects’ potential to secure funding. 


BY ALEXANDRA WITZE 


fter spending 10 years and driving 
Ams 39 kilometres on Mars, is there 

anything left for NASA’s Opportunity 
rover to do? 

Mission scientists think so. They say that 
Opportunity is on the verge of making some 
of its biggest discoveries yet. Just 3 kilometres 
south of the rover’s current position, satellites 
have spied a lode of water-bearing clay miner- 
als — a potentially habitable environment from 
atime when Mars was warmer and wetter than 
it is today. Matthew Golombek, Opportunity’s 
project scientist at the Jet Propulsion Labora- 
tory (JPL) in Pasadena, California, says that the 
rover is on the verge of hitting the “jackpot”. 

But, like six other ongoing NASA missions 
studying the Moon, Mars and Saturn (see 
‘Fighting for life’), Opportunity’s money is 
due to run out at the end of the US fiscal year, 


on 30 September. Managers for each mission 
are trying to convince the agency to cough 
up continued funding, and their arguments 
are due on 11 April. A ‘senior review’ panel of 
external planetary scientists will rank the pro- 
posals’ potential science return, and submit 
their suggestions to NASA headquarters for a 
final decision. 

Similar reviews take place every couple 
of years, but the upcoming one is especially 
fraught. NASA will almost certainly continue 
to fund its flagship rover, Curiosity, which 
landed on Mars less than two years ago. That 
leaves the older missions to battle it out for the 
rest of a limited pot of money. 

Jim Green, who heads NASA’s US$1.3- 
billion planetary-science division, says that 
the agency is working to find money for all of 
its operating spacecraft, and wants to dispel 
the idea that missions are battling to the death. 
“We'll go through the senior review, order 


IN FOCUS | NEWS 


missions and fund them,” he told attendees at 
the Lunar and Planetary Science Conference 
last week in The Woodlands, Texas. 

But hard choices often have to be made, says 
Nadine Barlow, a Mars researcher at Northern 
Arizona University in Flagstaff who served on 
the 2012 senior review panel. “You always have 
to keep in the back of your mind that this could 
be the end of the mission,” she says. 

For this year’s senior review, Green asked 
project scientists to take a slightly different 
approach from before. Each mission is to out- 
line scenarios for three different price tags: an 
ordinary operating plan; a larger budget that 
could cover extra science targets; and a finan- 
cial floor beneath which little science can real- 
istically be done. 

Above all, each mission needs to show what 
fresh science it has to offer. That can be chal- 
lenging when a spacecraft has already been 
exploring for years. 

One approach is to argue for studying plan- 
etary change over time. The Lunar Recon- 
naissance Orbiter (LRO) has been snapping 
high-resolution pictures of the Moon since 
2009, providing a long time series for studying 
surface changes. At the planetary conference, 
Mark Robinson of Arizona State University in 
Tempe showed off dramatic pictures of high- 
speed meteorite impacts on the Moon. On 
17 March 2013, for instance, automated cam- 
eras on Earth captured the brightest flash ever 
detected from one of these impacts. Soon after, 
the LRO photographed the resulting crater. 

The spacecraft has found many more exam- 
ples of fresh impacts. Surprisingly, the craters 
show debris spread over greater distances than 
might be expected. That observation could be 
important for future lunar exploration: astro- 
nauts might find themselves sprayed by rocks 
from far-flung impacts. 

The Cassini mission to Saturn also wants to 
examine temporal changes — in its case, the 
shifting seasons. Cassini arrived in 2004, just 
as Saturn's northern hemisphere was begin- 
ning to emerge from winter. “We're coming 
onto northern summer now, and were excited 
about the Sun climbing to the highest extent 
that we've seen,’ says project scientist Linda 
Spilker of JPL. 

That new Sun angle will give Cassini fresh 
views of Saturn's rings and of the giant hexago- 
nal jet stream around the planet’s north pole. 
On the planet’s largest moon, Titan, which 
harbours northern polar seas made of meth- 
ane and other hydrocarbons, the oncoming 
summer may change wind patterns and ruf- 
fle the oceans’ surfaces. Researchers studying 
the moon last week reported that they may 
have spotted the first waves outside Earth (see 
Nature http://doi.org/127; 2014); these epi- 
sodes could occur more frequently as summer 
approaches. Three close flybys of another of 
Saturn’s moons, Enceladus, are also planned 
for Cassini, to sample the geyser-like plumes 
of water vapour that spray out of fractures > 
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FIGHTING FOR LIFE 


Seven NASA planetary missions are competing 
against each other for extensions. 


Lunar Reconnaissance Orbiter 
Launched: 2009 
Annual operating cost*: 


Moon $8.1 million 


Mars Odyssey 
Launched: 2001 
Cost: $12.3 million 


Opportunity rover 
Launched: 2003 
Cost: $13.2 million 


Mars 


Mars Express 

Launched: 2003, by the 
European Space Agency. 
Cost (to NASA): $3 million 


Mars Reconnaissance Orbiter 
Launched: 2005 
Cost: $29.5 million 


Curiosity rover 
Launched: 2011 
Cost: $59.4 million 


Cassini 
Launched: 1997 
Cost: $65.7 million 


Saturn * Latest year available 


> near its south pole. 

Smaller missions are also angling for a 
piece of the division's pie. Those running 
the Mars Odyssey spacecraft, which was 
launched in 2001 and is the longest-lived 
Martian spacecraft ever, are moving the 
craft into a different orbit. That would 
allow it to look at the red planet from dusk 
and dawn angles that have not been tried 
since the 1970s. The Odyssey team is also 
trying to play up the fact that the orbiter 
is an essential communications tool: it is 
in a favourable orbit for relaying informa- 
tion from rovers on the Martian surface to 
Earth. The newer Mars Reconnaissance 
Orbiter also acts as a relay, but that mission 
has suffered several 
glitches in recent 


“You always 
years. h 
ave to keep 
For now, all the .  . jue 
planetary missions ra idb 
under review are [Mls couldbe 
running as planned the end of the 


1eQ) ” 
until the end of this ™USSton. 


fiscal year. Mem- 

bers of the review panel should be named 
shortly, and are due to meet in May and 
report their recommendations to Green, 
for his decision in June. 

Stephen Mackwell, head of the Lunar 
and Planetary Institute in Houston, Texas, 
thinks that NASA officials will find a way 
to keep most of the missions going, espe- 
cially if the project teams can show a way 
to take the science in a new direction. 
“Business as usual is not a sustainable 
approach,’ he says. = 
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Software patents 
await legal fate 


Supreme Court ruling could affect medical-diagnostics firms. 


BY HEIDI LEDFORD 


the US Supreme Court is tackling one of 

the most vexing questions in patent law: 
can software be patented? Its decision could 
affect the future of personalized medicine. 

On 31 March, the court will hear arguments 
in Alice Corporation v. CLS Bank International, 
a lawsuit over four patents on software intended 
to reduce risk in financial transactions. The 
case is the latest example of Supreme Court 
efforts to rein in the US Patent and Trademark 
Office, which is notoriously permissive in 
granting commercial protection to inventions. 
Two years ago, the court struck down a class of 
patent often used to protect medical diagnostic 
tests. Last year, it ended a 30-year tradition of 
issuing patents on naturally occurring genes. 

With the latest case, the court has the poten- 
tial to restrict software patents, with repercus- 
sions not only for the technology industry, but 
also, yet again, for medical-diagnostics compa- 
nies, says Jonathan Masur, deputy dean of the 
University of Chicago Law School in Illinois, 
who specializes in intellectual property. “It’s 
enormously important,’ he says. 

Software patents are controversial. Critics 
charge that they tend to be vague and cover 
obvious inventions, and that they hinder inno- 
vation. They are magnets for ‘patent trolls’ — 
businesses that collect broad patents mainly to 
force others to pay licensing fees. Some critics 
also argue that computer programs are simply 
the expression of mathematical formulae, and 
thus can be considered abstract ideas, which, 
along with laws of nature, are not patentable. 

Yet in 1981, the Supreme Court conceded 
that some software could be patented, and since 
then the number of software patents has soared. 

The Alice case began in 2007, when CLS Bank 
International, based in New York, sued Alice 
Corporation of Melbourne, Australia, charging 
that four of Alice’s patents were invalid. Alice 
countersued CLS Bank for infringing the same 
patents. But CLS Bank could be in a good posi- 
tion, if the Supreme Court’ interest in limiting 
patentability in recent years is any indication 
(see ‘Valid concerns’). Observers are hoping that 
the court will use the case to develop concrete 
criteria to determine which software patents are 
valid. “The problem with the law to date is that 
it's very subjective,’ says Robert Sachs, a partner 


Wes the first time in more than 30 years, 
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VALID CONCERNS 


The US Supreme Court (pictured) 
is clamping down on the kinds of 
inventions that can be patented. 


re \, 
= 


NOVEMBER 1972 
Benson. Ruled that computer programs 
cannot be patented if they merely 
perform mathematical calculations. 


Gottschalk v. 


VTE EES) Diamond v. Diehr. 
Ruled that a device controlled by a 
computer program was patentable — 
reopening the door to software patents. 


VN Aue KSR International 
v. Teleflex. Threw out a patent on an 
adjustable accelerator-pedal system for 
being too obvious. 


SLU ara Bilski v. Kappos. Struck 


down a business-methods patent, 
saying it was based on an abstract idea. 


| MARCH 2012 | Mayo Collaborative 


Services v. Prometheus Laboratories. 
Said two medical-diagnostics patents 
were based on laws of nature, and invalid. 


SUT a4 RE Association for 
Molecular Pathology v. Myriad Genetics. 
Ruled that natural genes cannot be 
patented. 


| 2014 | Alice Corporation v. CLS Bank 
International. Arguments heard on 
software patents. 


JONATHAN ERNST/REUTERS/CORBIS 


CIRO DE LUCA/REUTERS/CORBIS 


at the law firm Fenwick and West in San 
Francisco, California. “Every judge gets to 
decide: ‘Is this the kind of thing that I think 
a patent claim ought to cover?” 

To bolster its arguments, CLS Bank is 
citing a 2012 case, Mayo Collaborative Ser- 
vices v. Prometheus Laboratories, that struck 
down two medical-diagnostics patents 
and sent tremors through the diagnostics 
industry. The patents covered the act of 
measuring metabolites to determine the 
proper dosage of certain drugs used to treat 
autoimmune diseases. The nine judges on 
the Supreme Court unanimously decided 
that the patents merely laid claim to a law 
of nature — the body’s breakdown of the 
drugs — and were therefore invalid. 

The Mayo ruling represented a funda- 
mental change, says Brian Dorn, a patent 
attorney at the Barnes & Thornburg law 
firm in Minneapolis, Minnesota, who counts 
diagnostics companies among his clients. 
“Tt’s been hard to get around,’ he says. “We've 
been getting patent rejections based on that 
case.’ Some even blame the court’s decision 
for a drop in US venture-capital investment 
in medical-diagnostics companies — from 
US$395 million in 2011 to $278 million in 
2013, according to London-based consul- 
tancy firm PricewaterhouseCoopers. 

Mayo is also setting a precedent for lower 
courts. Last October it was used, along with 
the 2013 Association for Molecular Pathology 
v. Myriad Genetics ruling that invalidated 
patents on genes (see Nature 498, 281-282; 
2013), to throw out key patents on a promi- 
nent non-invasive prenatal test for Down's 
syndrome, held by the diagnostics company 
Sequenom of San Diego, California. The 
ruling, by a California district court, used a 
strict interpretation of Mayo, says Christo- 
pher Holman, a law professor at the Univer- 
sity of Missouri-Kansas City Law School. “If 
that is how the courts are going to use that 
case, it’s a very bad sign for industry,” he says. 

The Alice case could be a chance for 
the Supreme Court to clarify how Mayo 
should be interpreted, says Masur: rulings 
on abstract ideas (as is likely to arise from 
Alice) are often applied to arguments about 
patenting laws of nature (as in Mayo). If the 
Supreme Court invalidates many kinds of 
software patents, for example, diagnostics 
firms could be in for more trouble, he says. 

The Supreme Court's tendency towards 
restricting patents is a concern for Greg- 
ory Graff, an agricultural economist who 
studies intellectual property at Colorado 
State University in Fort Collins. He argues 
that the US patent system should not be 
reformed through a series of radical court 
decisions. Instead, he would prefer to see 
legislation and patent-office procedures 
tweaked as fields of technology evolve. “You 
don’t need the nuclear option of blowing a 
hole in what’s deemed patentable,” he says. m 
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Conservation work under the Great Pompeii Project began at the site in 2013. 


ARCHAEOLOGY 


Sister city inspires 
Pompeli rescue 


Archaeologists hope that funding and conservation boost 


can halt decay of ancient city. 


BY DECLAN BUTLER 


ncient Pompeii, which suffered three 
A= collapses this month, may look to 

its smaller sister city Herculaneum for 
tips to ensure its survival. Both cities, located 
on Italy’s Neapolitan coast, were simultane- 
ously destroyed and preserved by the eruption 
of Mount Vesuvius in AD 79, but the historic site 
of Herculaneum is faring much better, thanks to 
a sustainable conservation project. 

Now an international research consortium 
is planning a similar pilot scheme for Pompeii. 
And researchers and officials are hoping that 
a €105-million (US$145-million) European 
Union project to restore Pompeii will also draw 
on lessons from Herculaneum. 

The rescue plan cannot come soon enough. 
“A lot of buildings are at risk [of collapse],” one 
anonymous observer close to the restoration 
efforts told Nature, adding that, as well as the 
repair effort, “they need a system to maintain 
the site so that it doesn’t happen again’. 

Excavations of Pompeii, which, with Her- 
culaneum, is a World Heritage Site of the 
United Nations Educational, Scientific and 
Cultural Organization, began in the mid- 
eighteenth century. But conservation took off 


only in the early twentieth century, by which 
time much of the city had been reduced to 
walls. However, decay at the 66-hectare site 
has accelerated since the 1960s as a result of 
haphazard funding and the disbanding of 
in-house maintenance crews. 

This month’ collapses are the latest in a series 
of incidents since the 1980s. Water is the site's 
biggest enemy: unexcavated embankments 
absorb rainfall and exert pressure on adjoining 
structures, and poor drainage allows surface 
water to accumulate. A lack of regular repair 
work means that rain also seeps into the walls. 

Much of Pompeii’s masonry consists of 
rubble stonework, which is highly vulner- 
able. Typically, this consists of two wall cover- 
ings assembled with roughly placed stones of 
volcanic tuff (opus incertum), together with a 
core of smaller stones set in a lime-mortar grout 
(opus caementicium). 

Such constructions show excellent longev- 
ity if kept dry. But water infiltration has led to 
repeated wetting and drying, causing stones to 
chip and flake, and the mortar to decay and lose 
cohesion. In addition, many twentieth-century 
conservation interventions at Pompeii have 
since proved inappropriate and damaging. 

The authorities are responding, however. > 
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> In February 2013, the European Union and 
the Italian government launched an emergency 
€105-million project to reverse the decades of 
neglect. Dubbed the Great Pompeii Project, it 
focuses on draining and securing embankments, 
restoring masonry and decorated surfaces, and 
protecting buildings from weathering. But the 
project, the budget for which must be spent by 
the end of 2015, has been beleaguered by con- 
cerns that delays and bureaucracy may affect the 
quality of the work. In response, the European 
Commission said on 6 March that it will carry 
out an urgent public review of progress. 

Johannes Hahn, the European commissioner 
for regional policy, says that the review’s aim is 
clear. “We have professionals working on the 
site. The most helpful thing we can do for them 
is to remove the bureaucratic barriers and let 
them get down to work” 

Massimo Osanna, appointed this month 
as head of the Special Superintendency for 
the Archaeological Heritage of Naples and 
Pompeii, which is in charge of the site’s conser- 
vation, says that although a longer time frame 
would be preferable, criticism of the pace of 
progress is unfair. High-quality archaeologi- 
cal conservation of an open-air site as vast as 
Pompeii inevitably has long lead times, he 
argues, and plans for using the money are 
already well under way. 

Having a proper plan for the sustainable 
management of Pompeii is also a major issue. 
The outsourcing of routine maintenance to con- 
tractors since the 1980s has had severe effects, 


UNDER A SHADOW 


Pompeii and Herculaneum were simultaneously 
destroyed and preserved by the eruption of 
Mount Vesuvius in AD 79. 
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with repeated delays to urgent repairs, says 
Andrew Wallace-Hadrill, a social and cultural 
historian at the University of Cambridge, UK. 

Wallace-Hadrill heads what many see as a 
model for conserving Pompeii: the Hercula- 
neum Conservation Project (HCP). This was 
launched in 2001, when Herculaneum was ina 
dire state, as a public-private venture between 
the philanthropic Packard Humanities Institute 
of Los Altos, California, the British School at 
Rome and the superintendency. In just over a 
decade, and having spent €20 million in total, 
it has helped to reverse the decay and put the 
town ona sustainable conservation footing (see 
‘Under a shadow’). 

“Tm impressed; Herculaneum wasa very sad, 
dirty and abandoned place; now it’s fantastic,” 


says Albrecht Matthaei, an archaeologist at the 
Roma Tre University. Although under the juris- 
diction of the superintendency, the HCP’s level 
of autonomy frees it from state bureaucracy, 
notes Wallace-Hadrill. 

The HCP also brings together archae- 
ologists, engineers and other scientists with 
conservators. This, together with an empha- 
sis on routine maintenance, has been key, says 
Wallace-Hadrill. And the project differs from 
the typical conservation practice of carrying 
out complete restorations on a house-by-house 
basis by focusing on urgent site-wide restora- 
tions and repairs. 

Now, a proposed project, the Pompeii Sus- 
tainable Preservation Project (PSPP), is taking 
inspiration from the HCP. An international 
consortium of research institutions, including 
the Fraunhofer research organization and the 
Technical University of Munich in Germany, 
hopes to raise US$10 million in philanthropic 
funding to launch a ten-year pilot project later 
this year that will study long-term, sustainable 
conservation and restoration approaches at 
Pompeii. “HCP is best practice,” says Matthaei, a 
coordinator of the PSPP. The new project seems 
laudable, but until work begins it is difficult to 
assess, adds Wallace-Hadrill. 

Osanna says that Pompeii’s greater size makes 
comparisons with the HCP difficult. But he is 
confident that the Great Pompeii Project can fix 
the city. “People need to give the project time to 
get up to speed, after which any controversy will 
hopefully be put to rest,” he says. m 


Chile puts plan for science 
ministry on hold 


Government rethinks former president’s bid to create first unified science department. 


BY MICHELE CATANZARO 


r | The new government of Chile has shelved 
plans to create the country’s first science 
ministry, disappointing thousands of 

researchers who had campaigned for it. 

Former president Sebastian Pifera submit- 
ted plans for the ministry to the Senate on 

7 March — days before leaving office — in 

response to long-standing demands by sev- 

eral Chilean scientific organizations. The plans 
were part of a last-minute legislative package 
that included three other, unrelated laws. 

But now, politicians supporting new presi- 
dent Michelle Bachelet, who was sworn in on 

11 March, have derailed the proposal. Senator 


Jaime Quintana, spokesman for the centre-left 
New Majority coalition that backs Bachelet, 
said that the draft laws “are not legitimate” 
because to enact them would require money 
that the current government intends to allocate 
to its own priorities. 

“Tt is a regrettable situation: it was a hard 
fight until we got the draft law, and now we 
feel a big discomfort,” says Eduardo Bravo, 
director of the National Association of Post- 
graduate Researchers in Santiago, which was 
among the organizations that had called for 
the science ministry. 

Chile is a small but highly productive player 
in Latin American science. In 2010, it spent 
US$1.15 billion on research and development. 
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Although that is equivalent to just 0.44% of its 
gross domestic product, or 2.8% of Latin Amer- 
ica’s total investment in science, the country 
ranks behind only Brazil, Mexico and Argentina 
in terms of the number of articles published. 

However, science governance in Chile is 
currently spread across the ministries of edu- 
cation, economy and agriculture. “This has 
resulted in multipolar governance that has not 
been capable of establishing a clear agenda of 
strategies and priorities, or strong networks 
of scientists,” says Jesus Sebastian Audina, 
a researcher in science policy at the Spanish 
National Research Council in Madrid. 

Jorge Babul, president of the Chilean Coun- 
cil of Scientific Societies in Santiago, adds: 
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Current Chilean President Michelle Bachelet has her own plans for science governance. 


“Putting science in a ministry not dedicated to 
it prevents the minister from spending enough 
time on research policy.’ He thinks that a single 
unified ministry would ensure “a big leap” in 
the ability to transfer knowledge within Chile. 
Bravo says that a dedicated institution would 
“have more financial independence”. 

In April 2013, after years of campaigning, 
a group of organizations — including sev- 
eral scientific academies and societies and 
Santiago-based pro-science lobby group the 
More Science Foundation — presented Pifiera 
with an open letter asking that science policy 
be unified in a single ministry. The petition 
had 5,000 signatories. 

Pifiera responded by creating a working 
group of academics and politicians, led by engi- 
neer and entrepreneur Bruno Philippi, to evalu- 
ate Chilean scientific governance and propose 
changes. Ina report released last May, the group 
urged the government to create the ministry. 

Pifiera’s 7 March proposal to the Senate was 
strongly based on that report. According to 
his draft submission, the ministry would have 
had two sub-secretariats: one for higher educa- 
tion and another for science, technology and 
innovation. Furthermore, Chile’s main 
science-funding body, the National Commis- 
sion for Scientific and Technological Research 
(CONICYT), would regain a role in advising 
the president on science policy that was taken 
from it by the authoritarian regime of Gen- 
eral Augusto Pinochet in 1973. CONICYT’s 
resumption of this role would ensure that the 
president is advised by a dedicated science- 
governance body. 

Most of Pifiera’s proposal is widely sup- 
ported by the Chilean scientific community, 
says Sergio Hojman, a physicist at the Univer- 
sity of Chile in Santiago and a member of the 
working group headed by Philippi. But some 
scientists are sceptical about the feasibility and 


benefits of the plan. “Building a ministry that 
works may take decades in Chile,” says Mario 
Duran Toro, an engineer at the Pontifical 
Catholic University of Chile in Santiago. “It 
would be enough to re-establish the role that 
CONICYT had before 1974: that would be 
much more effective and quick, he says. 

As well as potentially taking funds away 
from the government’s own priorities, the 
proposed ministry conflicts with the coali- 
tion’s plans: Bachelet’s electoral manifesto put 

science in a sub-sec- 


“Building a retariat of the econ- 
ministry that omy ministry, not a 
works may ministry of its own. 
takedecadesin still, Hojman thinks 
Chile.” that Bachelet is not 


necessarily opposed 
to the science ministry in principle, given that 
several members of the Philippi commission 
are supporters of government coalition parties. 
The poor timing of the proposal may also 
have affected its chances. “Presenting it at the 
last minute is something that was going to be 
seen with a bad eye by the new government,’ 
says Carolina Mufioz, a chemist and executive 
director of the More Science Foundation. “But 
withdrawing it would show a very serious lack 
of respect towards Chilean scientists.” 
Finance minister Alberto Arenas is 
expected to advise on the financial viability of 
the science ministry ina report on the state of 
the country’s finances on 7 April. = 


CORRECTION 

The News story ‘Global seismic network 
takes to the seas’ (Nature 507, 151; 2014) 
wrongly located USGS seismologist Cecily 
Wolfe at the University of Hawaii in Honolulu 
instead of at Reston, Virginia. 
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N eanderthal 
in the family 


By Ewen Callaway 


Thirty years after the study of ancient DNA began, 
it promises to upend our view of the past. 


« 
. al 
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efore ancient DNA exposed 
the sexual proclivities of 
Neanderthals or the ancestry 
of the first Americans, there 
was the quagga. 

An equine oddity with the 
head of a zebra and the rump ofa donkey, the 
last quagga (Equus quagga quagga) died in 
1883. A century later, researchers published’ 
around 200 nucleotides sequenced from a 
140-year-old piece of quagga muscle. Those 
scraps of DNA — the first genetic secrets 
pulled from a long-dead organism — revealed 
that the quagga was distinct from the moun- 
tain zebra (Equus zebra). 

More significantly, the research showed 
that from then on, examining fossils would no 
longer be the only way to probe extinct life. 
“If the long-term survival of DNA proves to 
bea general phenomenon,’ geneticists Russell 
Higuchi and Allan Wilson of the University 
of California, Berkeley, and their colleagues 
noted in their quagga paper’, “several fields 
including palaeontology, evolutionary biology, 
archaeology and forensic science may benefit.” 

At first, progress was fitful. Concerns over the 
authenticity of ancient-DNA research fuelled 
schisms in the field and deep scepticism out- 
side it. But this has faded, thanks to laboratory 
rigour that borders on paranoia and sequencing 
techniques that help researchers to identify and 
exclude contaminating modern DNA. 

These advances have fostered an ancient- 
genomics boom. In the past year, research- 
ers have unveiled the two oldest genomes on 
record: those ofa horse that had been buried in 
Canadian permafrost for around 700,000 years’, 
and of a roughly 400,000-year-old human rela- 
tive from a Spanish cavern’. A Neanderthal 
sequence every bit as complete and accurate 
as a contemporary human genome has been 
released’, as has the genome of a Siberian child 
connecting Native Americans to Europeans’. 

Enabling this rush are technological 
improvements in isolating, sequencing and 
interpreting the time-ravaged DNA strands 
in ancient remains such as bones, teeth and 
hair. Pioneers are obtaining DNA from ever 
older and more degraded remains, and glean- 
ing insight about long-dead humans and other 
creatures. And now ancient DNA is set to move 
from the clean-rooms of specialists to the labs 
of archaeologists, population geneticists and 
others. Thirty years after the quagga led the 
way, Nature looks to the field’s future. 


A MILLION-YEAR-OLD GENOME 

Ludovic Orlando, an evolutionary biologist at 
the University of Copenhagen, had low expec- 
tations when he started sequencing DNA from 
a 560,000-to-780,000-year-old horse leg bone. 
His colleague, Eske Willerslev, had discovered 
the bone buried in the permafrost of the Cana- 
dian Yukon in 2003. Then he had chucked it into 
a freezer, waiting for technological improve- 
ments that would allow the bone’s degraded 
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DNA to be read. (Freezers in ancient-DNA labs 
brim with such ‘wait and see’ samples.) 

On a Sunday evening in 2010, Willerslev 
called Orlando to say that the time had come. 
Orlando was unconvinced: “I started the pro- 
ject with the firm intention of proving that it 
was not possible,” he says. 

Sequencing ancient DNA is a battle against 
time. After an organism dies, the long strands 
of its DNA fissure into ever shorter pieces, 
helped along by DNA-munching enzymes. 
Low temperatures slow this process, but even- 
tually the strands become so short that they 
contain little information. 

To read the horse's genome, Orlando needed 
to shepherd useful DNA fragments through 
the harsh enzymatic treatments used to extract 
them and ready them for sequencing. Orlando 
and his team found that the preparation lost 
vast quantities of fragments. But with a few 
tweaks to the experimental protocol, such 
as reducing the extraction temperature, the 
researchers captured ten times more scraps of 
DNA than before — and produced a draft of 
the oldest genome on record’. 

Using a similar approach, Svante Paabo, a 
geneticist at the Max Planck Institute for Evolu- 
tionary Anthropology in Leipzig, Germany, and 
his team turned their attention to 400,000-year- 
old remains from the Sima de los Huesos cav- 
ern in northern Spain, which may have been a 
burial pit for recent relatives of modern humans 
called hominins (see ‘Hidden heritage’). In the 
pit, the bones remained at stable, low tempera- 
tures, slowing the breakdown of DNA. “If you 
could have told the hominins where to leave 
their bones, you may have chosen that site,” says 
Matthias Meyer, a molecular biologist at Paabo's 
institute who is leading the efforts. 

Last December, the team reported’ roughly 


HIDDEN HERITAGE 


Homo antecessor 
may be related toa 
ghost population 


that bred with 
archaic humans 
called Denisovans. 


1 million years ago 800,000 


16,300 letters of a Sima de los Huesos individ- 
ual’s mitochondrial genome — the DNA from 
power-generating structures in its cells. The 
sequence revealed an unexpected relationship 
between the Sima de los Huesos remains and 
the Denisovans, an archaic group of humans 
that Paabo’s team had discovered in Russia’s 
Altai Mountains thousands of kilometres away. 
Meyer and his colleagues hope to improve 
their methods enough to obtain some or all 
of the Sima de los Huesos individual's nuclear 
genome, the DNA from the nuclei of its cells. 
“Tt must be possible,’ says Meyer. “I won't rest 
until this has been done.” 

It is now a matter of when, not if, someone 
will produce a genome from an Arctic animal 
buried in permafrost for longer than 1 mil- 
lion years, says Meyer. But he and Paabo want 
to push the limits of ancient DNA in hominin 
specimens from warmer locales, such as fos- 
sils of Homo erectus, the common ancestor 
of humans and Neanderthals, found in Asia. 
And Orlando says that researchers may have 
luck using new extraction techniques on previ- 
ously vexing remains such as Egyptian mum- 
mies or Homo floresiensis, a small hominin at 
least 18,000 years old that was found in a cave 
on the Indonesian island of Flores. “It opens a 
great number of places where there are lots of 
important stories going on, such as the Middle 
East or the tropics,” he says. 


GHOSTS IN THE CODE 

A few years ago, David Reich discovered a 
ghost. Reich, a population geneticist at Har- 
vard Medical School in Boston, Massachusetts, 
and his team were reconstructing the history of 
Europe using genomes from modern people, 
when they found a connection between north- 
ern Europeans and Native Americans. They 


Researchers hope to 
push the limits of 
technology to tackle 
Homo erectus fossils 
found in relatively 


warm climates. of kilometres away. 
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Fossils in Spain show an 
unexpected kinship with 
Denisovans found thousands 
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proposed that a now-extinct population in 
northern Eurasia had interbred with both the 
ancestors of Europeans and a Siberian group 
that later migrated to the Americas®. Reich 
calls such groups ghost populations, because 
they are identified by the echoes that they leave 
in genomes — not by bones or ancient DNA. 

Ghost populations are the product of statisti- 
cal models, and as such should be handled with 
care when genetic data from fossils are lacking, 
says Carlos Bustamante, a population geneticist 
at Stanford University in California. “When are 
we reifying something that’s a statistical arte- 
fact, versus when are we understanding some- 
thing that's a true biological event?” 

Sometimes these statistical spectres get a 
body. Last year, Willerslev’s team reported” 
the genome from 24,000-year-old remains 
dubbed the Mal’ta boy. The results showed 
that the boy, who had been found in central 
Siberia, came from a population related to 
both modern Native Americans and modern 
Europeans, matching Reich’s prediction (see 
Nature http://doi.org/12b; 2013). “It’s a spec- 
tacular find? he says. 

Ghost populations also lurk in ancient DNA. 
While analysing high-quality genomes of a 
Neanderthal and a Denisovan, a team led by 
Reich and Montgomery Slatkin at the Univer- 
sity of California, Berkeley, noticed a peculiar 
pattern: present-day sub-Saharan Africans 
are more closely related to Neanderthals than 
they are to Denisovans*. But evidence from 
other ancient genomes suggested that the two 
archaic groups were equally related to present- 
day Africans. After weighing the possibilities, 
the scientists realized that they might have 
uncovered another ghost population. 

The puzzle could be solved, they theorized, 
if Denisovans had interbred with a species that 


The study of ancient DNA is revealing connections between archaic 
humans — and the traces they left behind in modern genomes. 
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had left Africa perhaps more than 1 million 
years ago and branched off from the common 
ancestor of humans, Neanderthals and Den- 
isovans. Subsequent Denisovans would have 
inherited DNA sequences that present-day 
Africans lack, explaining why Neanderthals 
seem to be closer kin to Africans. 

Reich’s team is analysing genetic 
signatures in humans with Denisovan 
DNA to establish when the Denisovans 
mated with this mystery population 
— information that could narrow the 
range of fossils to which it might belong. 
Genomes studied by Paabo’s lab, prin- 
cipally the Sima de los Huesos remains, 
may also reveal clues. 

Reich is not the only one conjur- 
ing ghosts. Chris Stringer, a palaeo- 
anthropologist at the Natural History 
Museum in London, has proposed that 
the 900,000-year-old hominin Homo 
antecessor, known from fossils found 
near Sima de los Huesos, could be part of the 
ghost population. If it had interbred with an 
ancestor of the Denisovans and the Sima de los 
Huesos hominins, it could explain the relation- 
ship between the two groups of remains. Test- 
ing that hypothesis would require the elusive 
Sima de los Huesos nuclear DNA. But Reich is 
optimistic that Paabo and his team will pull it 
off. “They've done miracles before in that lab 
and they may succeed again.” 


THE NEANDERTHAL WITHIN 

“We dont need bones necessarily to find ancient 
DNA,’ says Josh Akey, a population geneticist 
at the University of Washington in Seattle. “We 
can find the remnants of ancient DNA floating 
around in contemporary populations.” 

If early human populations bred with Nean- 
derthals and Denisovans, their descendants 
should carry short segments of archaic-human 
DNA. Researchers such as Akey are beginning 
to catalogue these segments to learn about the 
biology of archaic humans. Unlike the hunt 
for ghost populations, which relies on statis- 
tical population models, this approach allows 
researchers to identify specific regions of the 
genome acquired by interbreeding. 

In January, independent teams led by Akey’ 
and Reich® pieced together a substantial por- 
tion — about 20% and 40% respectively — of 
the Neanderthal genome from bits lurking in 
the genomes of hundreds of living humans. 
Their research indicated that some Europeans 
and Asians had gained genes involved in skin 
and hair from Neanderthals, possibly helping 
their ancestors to adapt to cold climates by pro- 
viding thicker skin, more hair and fewer pores 
(see Nature http://doi.org/rz9; 2014). But giant 
swathes of the modern genomes were devoid 
of Neanderthal ancestry, hinting that many 
Neanderthal genes might have been harmful 
in modern humans. Akey’s team identified’ one 
such region around the gene FOXP2, which is 
involved in speech and language. “It’s extremely 


compelling evidence that there were fitness 
costs to interbreeding,” he says. 

These discoveries are only the beginning. The 
Akey and Reich teams found that the genomes 
of east Asians possess, on average, slightly more 
Neanderthal DNA than do people of European 


With tools that 
make sequencing 


ancient DNA cheaper 


and easier, the field 
is becoming more 
egalitarian. 


ancestry. Akey sees this as possible evidence that 
Neanderthals interbred with ancient humans on 
at least two separate occasions: once with the 
ancestors of all Eurasians, and later with a popu- 
lation ancestral only to east Asians. And Akey 
believes that humans are likely to bear genetic 
scraps from other extinct species, includ- 
ing some that interbred with the ancestors of 
humans in sub-Saharan Africa. 


ANCIENT DNA FOR THE MASSES 

For much of the past 30 years, the sensitivity 
of the polymerase chain reaction (PCR), the 
method used to amplify ancient DNA, made 
it prone to contamination. The field’s leaders 
often greeted the work of outsiders with suspi- 
cion, earning some of them the title ‘the PCR 
police. And in recent years, palaeogenomics 
has been the domain of specialist labs such 
as Paadbo's, with the expertise and money to 
obtain and screen hundreds of fossils to find 
the few that yield enough DNA to sequence 
an entire genome. 

That is set to change. New procedures mean 
that researchers can now reliably obtain DNA 
from all but the most degraded samples, and 
then sequence only the portions of a genome 
that they are interested in. “I'm still surprised 
that there are so few labs in the world that do 
this,” says Johannes Krause, a palaeogeneticist 
at the University of Tibingen, Germany, who 
led much of the Denisovan work while in 
Paabos lab. “It’s not rocket science.’ 

Gradually, new researchers are entering the 
field. “If I can break in, then anyone can,’ jokes 
Bustamante. His research originally focused 
on ancestry in current human populations. 
Then, a fewyears ago, he gota phone call about 
a mummy. 

An international team had sequenced 
the genome of Otzi, a 5,300-year-old frozen 
corpse found in the Tyrolean Alps of Italy 
in 1991. The researchers wondered if Busta- 
mante could help them to make sense of the 
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ice-man’s ancestry. Together, they showed that 
Otzi was more closely related to humans who 
now live in Sardinia and Corsica than those in 
central Europe, evidence that the population of 
Europe when he was alive looked very different 
to how it does today”. 

Bustamante has since plunged into 
the world of ancient DNA. His team is 
sequencing samples that chart the arrival 
of farming in Bulgaria, the transatlan- 
tic slave trade in the Americas and dog 
domestication. The group is developing 
tools to make sequencing ancient DNA 
cheaper and easier. “We want to democ- 
ratize the field? says Bustamante. 

Reich, too, sees ancient DNA becom- 
ing more egalitarian. His lab’s growing 
interest in areas of human history such 
as the advent of agriculture or the his- 
tory of the Indian subcontinent has led 
it to analyse — often in bulk — remains 
less rarefied than the scarce Neanderthal 
samples that first lured him to the field. 

Last year, Reich was part of a team that 
reported”° an analysis of mitochondrial DNA 
from 364 European samples between 5,500 and 
1,550 years old, to identify major population 
shifts in Neolithic Europe. Ancient genom- 
ics is also set to solve long-standing questions 
about when and where humans domesticated 
animals such as dogs, cattle and chickens. A 
2013 study" of 18 mitochondrial genomes 
from ancient dogs and wolves, for instance, 
suggested that European hunter-gatherers 
domesticated wolves from a population that is 
now extinct. 

Researchers are also returning to the ques- 
tions that launched the field 30 years ago. 
Around the time that Orlando's team began 
sequencing the 700,000-year-old horse, it also 
turned its attention to a much younger sample 
— from the quagga. 

The effort to sequence the full quagga 
genome is part of large project to understand 
the evolutionary relationship between living 
and extinct horses, zebras and donkeys, and 
to identify the genetic basis for certain traits. 
“I was thinking it would be cool to do the 
oldest, but also the first — where ancient DNA 
started,” says Orlando. “It shows the progress 
the field has made?” m 


Ewen Callaway writes for Nature in London. 
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A tolerant approach 


Despite along record of failure, a few immunologists continue to 
pursue precisely targeted therapies for autoimmune diseases. 


BY KEN GARBER 


he has fretted over his meals, blood glucose levels and the daily 

programming of his insulin pump. Wiley, a statistician who lives 
outside Boulder, Colorado, and works on big data analytics, has learned 
to live in a state of hypervigilance. Finding the right dose of insulin 
turned out to be more art than science and, like many with the disease, 
his control began slipping away with time. By 2008, he says, “my insulin 
doses just basically didn’t work any more”. Unable to reliably anticipate 
what he needed, Wiley was having severe hypoglycaemic episodes and 
was at risk of diabetic seizures and long-term disability. 

On his endocrinologist’s advice, he enrolled in a clinical trial of a 
novel drug called BHT-3021. Although technically a vaccine, BHT-3021 
is not designed to stimulate an immune response, but rather to shut it 
down, stopping the body’s errant attack against cells in the pancreas that 
produce insulin. The goal is to achieve immune tolerance. 

Drugs that broadly suppress immunity are the standard treatment 
for autoimmune disorders such as multiple sclerosis (MS), rheuma- 
toid arthritis and lupus. But these drugs can lead to life-threatening 


RB since Ed Wiley learned that he had type 1 diabetes in 1997, 
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infections, and do not address the cause of the disease. Tolerance thera- 
pies are different. They aim to target only the immune cells that react to 
a specific antigen, a substance — in Wiley’s case, the insulin precursor 
proinsulin — that might trigger a response. “Why shut down a major 
arm of the immune system, if we're just trying to restore tolerance to one 
antigen?” asks immunologist Larry Steinman of Stanford University in 
California, who developed BHT-3021. 

This strategy, known as antigen-specific tolerance, is simple in con- 
cept. But, so far, dozens of clinical trials have failed to achieve a categori- 
cal success. And there is a fine line between calming the immune system 
and stimulating it, so these efforts risk making a disease worse — as 
happened in an MS trial some 15 years ago. 

BHT-3021 is one of a new wave of treatments conceived by five vet- 
erans of the field that promises to do better. Early-stage trials show 
encouraging results in people with MS and type 1 diabetes. “A number 
of these approaches really are going to work,” predicts David Wraith, 
an immunologist at the University of Bristol, UK, and one of the few 
persisting in pursuing the work. “The science has caught up.’ 


AUTOIMMUNE OVERRIDE 

The approaches are varied, but they all rely on the body’s natural abil- 
ity to distinguish its own substances from those of foreign intruders. 
When bacteria or viruses invade, some are swallowed by specialized 
antigen-presenting cells, or APCs. These chop up the bacterial or viral 
antigens and present them to T cells, white blood cells that orchestrate 
the immune response. The T cells then proliferate and launch a coor- 
dinated attack. 

APCs also ensure that normal daily maintenance does not turn 
deadly. As the body’s own cells continuously die and are replenished, 
APCs mop up the debris and present those self-antigens to T cells 
along with an array of proteins that signal that these cellular remnants 
pose no danger. In autoimmunity, for unknown reasons, this protec- 
tive mechanism goes awry. The new therapies are designed to override 
this dysfunction by deliberately sending the relevant antigen to tissues 
where the body is likely to see it as a non-threatening part of itself (see 
“Teaching tolerance’). 

Most of the therapies developed so far target MS, which occurs when 
the immune system attacks the myelin sheath that protects neurons 
in the brain and spinal cord. Immunologist Stephen Miller of North- 
western University in Chicago, Illinois, designed a therapy’ that he 
and neurologist Roland Martin, now at University Hospital Zurich in 
Switzerland, began testing in patients in 2009. During the treatment, 
the patients’ white blood cells are extracted, chemically linked to seven 
myelin antigens, then reinfused. The cells make their way to the spleen, 
where they die and release the antigen, which is picked up by APCs. 

Wraith’s drug’, ATX-MS-1467, uses four peptides, or pieces, of a 
myelin protein commonly attacked in MS. These injected antigens are 
taken up by immature APCs, which are incapable of stimulating T cells 
and instead inactivate them or convert them to a T-cell type that main- 
tains tolerance. Krzysztof Selmaj’s group at the Medical University of 
Lodz in Poland designed a similar therapy for the disease using three 
myelin peptides delivered through a patch that users wear on their skin’. 

Instead of protein fragments, Steinmans diabetes treatment consists of 
circular pieces of DNA carrying the proinsulin gene, injected into mus- 
cle*. The proinsulin protein is manufactured in the muscle cells, and is 
then secreted, taken up by APCs and presented to T cells. This produces 
“a signal that doesn't invoke danger but invokes tolerance’, says Steinman. 
So far, this treatment and the other new therapies have been tested in 
fewer than 150 patients, but industry onlookers say they show promise. 


DANGEROUS GROUND 

The treatments not only have to overcome strong autoimmunity, they 
must also avoid making it worse. “We need to be extremely cautious,” 
says Gerald Nepom, an immunologist at the Benaroya Research Insti- 
tute in Seattle, Washington. Any novel manipulation of the immune sys- 
tem involves some risk. In 2006, an antibody-based treatment developed 
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by the German drug company TeGenero was given to six healthy vol- 
unteers in a UK trial. Designed to quell the autoimmune response by a 
mechanism different from antigen-specific tolerance, it instead caused 
a massive immune response and multiple-organ failure’. The partici- 
pants survived, and investigators have since implemented safer dosing 
protocols, but researchers know that an immune response can quickly 
go wrong. “People ask me what keeps me up at night,” says Steinman. 
“Until we have quite a few patients under our belts for some time, it’s 


“We have to take small steps 
and see what happens.” 


the worry that we're going to make things worse.” 

Antigens can easily trigger immunity instead of tolerance, because 
there is a delicate balance between the two fates. Drug dose, delivery 
route, tissue destination and unpredictable changes in T-cell identity all 
matter. In a trial for an MS treatment that began in 1998 (ref. 6), doctors 
gave patients a single modified myelin peptide, but the trial was halted 
after three of the first eight recipients suffered worsening symptoms; one 
was left unable to walk. All three recovered with immunosuppressive 
treatment, but tests clearly implicated the experimental therapy, and the 
researchers ultimately worked out that they had extrapolated too high 
a dose from earlier in vitro studies. “We were both frustrated and also 
shocked,” says Martin, a lead investigator on the trial. 

Most trials so far have simply failed to work. In 2009, a trial of a 
myelin peptide antigen involving 612 people with MS showed no benefit 
over a placebo’. One likely reason is that the immune response in most 
autoimmune diseases can shift from one antigen to another as tissue 
damage progresses. Miller documented this phenomenon, known as 
epitope spreading, in animal models 20 years ago®. Martin points out 
that the failed 2009 trial used a single antigen, and adds that the patients 
involved were at a very advanced stage of the disease, when the immune 
system is no longer the main cause of neuronal damage. 

Wraith says that the field has learned from its mistakes. The new thera- 
pies all incorporate multiple antigens to anticipate epitope spreading, for 
example. And researchers are paying close attention to other relevant 
factors, such as how the drugs are administered. In the past, says Chris- 
tophe Benoist, an immunologist at Harvard Medical School in Boston, 
Massachusetts, it was often more of a gamble. The approach, he says, 
was, “Let's just put the antigen in and hope something good happens”. 

Richard Ransohoff, an MS researcher at the Cleveland Clinic in Ohio, 
has faith in the new therapies, which build on recent advances in under- 
standing antigens and T cells. “These are all very experienced people 
who understand the complexity of what they’re trying to do,” he says. 
That is not to say that the mechanisms of tolerance have been completely 
worked out. “We're working furiously,” says Steinman, but he admits that 
they are operating in “a vast ocean of ignorance in between icebergs of 
knowledge”. Waiting for that perfect understanding, however, seems 
foolish. “We have to take small steps and see what happens, he says. 

Results from the latest wave of trials are reassuring. Martin’s clinical 
team gave nine patients a single injection of the manipulated immune 
cells, in escalating doses. The treatment seems to be safe, and the four 
patients receiving the highest doses showed a reduction in the number 
of T cells targeting self-antigens’. “That was a very positive proof-of- 
concept study,” says Nepom, who was not involved in any of the trials. 

The most promising trial, he adds, was a 30-patient test of Selmaj’s 
therapy. Compared with a placebo, the treatment achieved a statisti- 
cally significant drop in MS disease activity, as measured by magnetic 
resonance imaging of the brain. Patients assigned to the therapy also 
had far fewer relapses’”. 

In Wraith’s ATX-MS- 1467 trial, sponsored by Apitope in Diepenbeek, 
Belgium, 43 patients received the treatment in a series of five escalating 
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Teaching 
tolerance 


Ny 


Sentinels called antigen-presenting 
cells (APCs) engulf and chop up 
biological molecules, and present 
them on specialized surface proteins. 


doses either under or into the skin. Those in the latter group showed a 
large reduction in MS activity, says Wraith, although this returned three 
months after the treatment ended. The data have not yet been published. 

Steinman’s 80-patient, placebo-controlled, BHT-3021 diabetes trial 
established that the drug was safe, and one dosage group showed a 
statistically significant increase in a marker of insulin secretion and a 
simultaneous decrease in the number of T cells targeting proinsulin”. 

Wiley believes the treatment has helped. During the 12-week study, 
he says, he regained control of his insulin dosing. And, subjectively at 
least, the positive effects have persisted. 


MOVING FORWARD 

But the legacy of failure has left many researchers — and drug compa- 
nies — deeply sceptical. Even after positive trial results, Martin says, 
there are “very few investigators that are masochistic enough over a 
long time to continue in this field, because it’s difficult to get funding”. 
He and Selmaj are trying to get support for their new MS trials. And 
Steinman and the company he co-founded, Tolerion of Portola Valley, 
California, are seeking between US$20 million and $30 million to take 
their diabetes vaccine to the next stage. “It’s impossible to say when this 
is going to happen,’ Steinman says. His first diabetes trial was spon- 
sored by the biotech giant Genentech, based in South San Francisco, 
California. But after acquiring the company in 2009, Swiss drug firm 
Roche sought to get out of type 1 diabetes therapies, Steinman says. The 
company returned the therapy’s licence even before the trial ended, and 
terminated all future commitments. 

Others have had more success. Another Swiss drug company, Merck 
Serono, has already moved Wraith’s therapy into phase I] to confirm its 
efficacy. Miller, meanwhile, is planning a phase I trial of a variation on 
his original approach that uses biodegradable nanoparticles instead of 
blood cells as antigen carriers’. And Chicago-based COUR pharma- 
ceuticals — co-founded by Miller — has signed partnerships with two 
drug companies for trials in people with diabetes and coeliac disease. 

The next round of trials, Nepom says, should incorporate detailed 
mechanistic studies “in order to learn whether we've chosen the right 
antigen, and whether we've chosen the right dose and route” in treated 
patients. This is because, in diseases such as MS and diabetes, research- 
ers can only guess which antigen triggers the immune response at any 
point in time. “That's an incredibly challenging problem,’ says Nepom. 

He heads the Immune Tolerance Network (ITN), a US-led inter- 
national consortium that allocates roughly $27 million a year from the 
National Institutes of Health for clinical trials and related studies. In his 
view, antigen therapies should be used in combination with drugs from 
a second major category of tolerance treatment. Unlike antigen-specific 
therapies, these treatments do not inactivate, convert or destroy the 
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To stop the immune system from attacking healthy tissues, as in autoimmune 
disease, researchers are exploring ways to induce tolerance, essentially 
achieving the opposite goal of a conventional vaccine. They focus on how 
immune cells called T cells are trained to recognize antigens — peptides and 
other molecules that the immune system can identify as a threat. 


AT cell has a receptor 
that binds to the 
presented antigen. 


In conventional vaccination, the antigen generally appears 
along with other ‘co-stimulatory’ molecules on the surface of 
the APC. This combination puts the T cells on the alert and 
helps immunize against anything bearing that antigen. 


Without co-stimulation — or in the presence of other inhibitory 
signals — the T cell dies, becomes inactivated or takes ona 
regulatory role, preventing other immune cells from attacking 
healthy tissue. This is the goal in antigen-specific tolerance. 


relatively small number of T cells that attack a given antigen. Instead, 
they are designed to tilt the balance of the body’s T-cell repertoire away 
from the subtypes that promote inflammation and towards those that 
maintain a state of tolerance, without impairing normal immunity 
against pathogens. These tolerance treatments have worked only tempo- 
rarily, if at all, in autoimmunity trials. Nepom thinks that combining the 
two approaches will produce longer-lasting results. The ITN will soon 
require that all of the antigen trials it supports be combination trials. 

“That's definitely the path to go,’ agrees Benoist. “Whether that will 
work or not, who knows. But at least that’s a more rational way of doing 
things.” 

But pushing combinations does not sit well with some in the field. 
Miller, for example, doesn't think that the second approach achieves true 
tolerance, and he fears that combining it with antigen therapies will con- 
volute the results. “If you want to test tolerance, it really has to be done 
as an individual entity,’ he says. Wraith thinks that adding the other 
class of treatment may even interfere with antigen-specific tolerance. 

But the combination approach should reduce risk, say Nepom and 
ITN founder Jeff Bluestone, an immunologist at the University of 
California, San Francisco. This is because shifting the overall system 
towards tolerance should help to blunt any unexpected reaction to 
antigens. 

Miller and Wraith say that their treatments have already proved 
safe in patients, and Wraith debated with Bluestone during a meet- 
ing in the Netherlands last October. Wraith calls Bluestone’s concerns 
unfounded. Bluestone’s reply: “I hope he’s right.” 

Wiley, for his part, had no problems while taking BHT-3021. The only 
side effect he noticed was the welcome disappearance of some painful 
plantar warts. He thinks that BHT-3021 stabilized his body’s ability to 
produce insulin, and would gladly take part in a longer study. “No ques- 
tion,’ he says. “I would jump at the opportunity” = 


Ken Garber is a science writer in Ann Arbor, Michigan. 
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QBism puts the scientist 
back into science 


A participatory view of science resolves quantum paradoxes and finds room 
in classical physics for ‘the Now’, says N. David Mermin. 


external world: particles, waves and 

fields; how they change in time; and 
how they give rise to the forms of matter, ter- 
restrial and extraterrestrial, microscopic and 
macroscopic. This world makes itself known 
to each of us through our own private internal 
perceptions. Yet physical science has ignored 
the ‘subject’ — the scientist — even though 
their subjective experience constitutes their 


Pec: science describes the objective 


only link with the external world. 

In Nature and the Greeks’, Austrian physi- 
cist Erwin Schrodinger traced the removal of 
the subject from science back more than two 
millennia. Alongside the spectacular success 
of physical science, this exclusion of personal 
experience has given rise to some vexing and 
persistent puzzles and paradoxes. 

Two such unrelated long-standing 
problems are both resolved by recognizing 
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that the perceiving subject has as important 
arole to play in understanding the nature of 
physical science as does the perceived object. 
The first problem is the notorious disa- 
greement, confusion and murkiness that for 
almost a century has plagued the foundations 
of quantum mechanics, in spite of the theory's 
extraordinary usefulness and power. The sec- 
ond, less famous, problem has been with us 
at least as long: there seems to be nothing P 
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> in physics that singles out ‘the present 
moment: Albert Einstein called this the prob- 
lem of ‘the Now. Both problems are symp- 
toms of the exclusion from physical science 
of the perceiving subject, and are solved by 
restoring what the ancient Greeks removed. 


QUANTUM MECHANICS 

Schrodinger wrote in a little-known 
1931 letter* to German physicist Arnold 
Sommerfeld that quantum mechanics 
“deals only with the object-subject relation” 
Another founder of quantum mechanics, 
Danish physicist Niels Bohr, insisted in a 1929 
essay" that the purpose of science was not to 
reveal “the real essence of the phenomena” 
but only to find “relations between the mani- 
fold aspects of our experience”. 

In spite of these early hints, it was only in 
the twenty-first century that US physicist 
Christopher Fuchs and British-German 
physicist Riidiger Schack*® put forth an 
understanding of quantum mechanics that 
restored the balance between subject and 
object. They call their new point of view 
‘QBism’: Q is for quantum and B is for Bayes- 
ian — a view of probability that includes an 
agent who makes bets and updates odds. 
QBism attributes the muddle at the founda- 
tions of quantum mechanics to our unac- 
knowledged removal of the scientist from 
the science. 

Much of this muddle is associated with 
the ‘wavefunction that quantum mechanics 
assigns to a physical system. This irritatingly 
uninformative term reveals the lack of clar- 
ity present in the field from its very begin- 
ning in 1925. People argue to this day about 
whether wavefunctions are real entities, like 
stones or ripples on a pond, or mathemati- 
cal abstractions that help us to organize our 
thinking, like the calculus of probabilities. 

Fuchs and Schack adopt the latter view. 
They take a wavefunction to be associated 
with a physical system by an agent — me, for 
example, based on my past experience. I use 
the wavefunction, following rules laid down 
by quantum mechanics, to calculate the likeli- 
hood of what I might experience next, should 
I choose to probe further. Depending on what 
I then perceive, I can update the wavefunction 
on the basis of that experience, allowing me 
to better assess my subsequent expectations. 

People who believe wavefunctions to be 
as real as stones have invested much effort in 
searching for objective physical mechanisms 
responsible for such changes in the wave- 
function: a novel manifestation of gravity, for 
example, or a new kind of fundamental all- 
pervasive friction. But according to QBism, 
the change is only in my personal expecta- 
tions, which I revise to accommodate my new 
experience. 

Another celebrated part of the muddle 
produced by the exclusion of the perceiving 
subject is ‘quantum non-locality; the belief of 
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some quantum physicists and many mystics, 
parapsychologists and journalists that an 
action in one region of space can instantly 
alter the real state of affairs in a faraway 
region. Thousands of papers have been writ- 
ten about this mysterious action at a distance 
over the past 50 years. A clue that the only 
change is in the expectations of the perceiv- 
ing subject’ is that to learn anything about 
such alterations one must consult somebody 

in the region where the action took place. 
Most physicists who have paid some atten- 
tion to QBism have rejected this explicit intro- 
duction of subjective 


“The Now is personal experience 
neither an into science, together 
illusion nor with its consequences 
a spurious for our understanding 
manifestation of quantum physics. 
of temporal It offends their sense 


that science is strictly 
objective. 

QBists are often charged with solipsism: a 
belief that the world exists only in the mind 
ofa single agent. This is wrong. Although I 
cannot enter your mind to experience your 
own private perceptions, you can affect my 
perceptions through language. When I con- 
verse with you or read your books and articles 
in Nature, I plausibly conclude that you are a 
perceiving being rather like myself, and infer 
features of your experience. This is how we 
can arrive at a common understanding of our 
external worlds, in spite of the privacy of our 
individual experiences. 


chauvinism.” 


THE NOW 

The QBist conversation can be broadened 
to include issues in which neither quantum 
mechanics nor probability plays a part, such 
as the problem of the Now, which arises in 
purely classical (pre-quantum) physics. I 
change the term to ‘CBism when describ- 
ing applications of the QBist view of science 
in such classical settings. Here C stands for 
classical and B, for Bohr, whose wisdom 
went beyond quantum mechanics when 
he taught that physical science studies our 
experience. 

Philosopher Rudolph Carnap* recalls 
that the problem of the Now worried Ein- 
stein seriously. Einstein told him that the 
experience of the present moment means 
something special for mankind, essentially 
different from the past and the future, and 
that physics cannot describe such a differ- 
ence. Carnap described Einstein as painfully 
resigned to the inability of science to grasp 
this experience. 

The issue for Einstein was not the famous 
revelation of relativity that whether or not 
two events in two different places happen at 
the same time can depend on your frame of 
reference. It was simply that physics seems 
to offer no way to identify the Now even at 
a single event in a single place, although a 
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local present moment — Now — is evident 
to each and every one of us as undeniably 
real. How can there be no place in physics 
for something as obvious as that? 

My Now — my current state of affairs — is 
a special event for me while it is happening. 
I can tell my Now from earlier events, which I 
only remember, and from later events which 
Ican only anticipate or imagine. The status of 
an event as my Nowis transitory: it becomes a 
memory as subsequent Nows emerge. 

Yet clear, evident and banal as this is to 
us all, there is no Now in the usual physi- 
cal description of space and time. Physicists 
represent all the events experienced by a 
single person as a line in four-dimensional 
space-time, called that person's ‘world-line. 
There is nothing about any point on my 
world-line that singles it out as my Now. 

When I recently mentioned to an eminent 
theoretical physicist that I was writing an 
essay explaining how the QBist view of 
science solves the strictly classical prob- 
lem of the Now, he said: “Ah, you're going 
to explain why we all have that illusion” 
Anda distinguished philosopher of science 
recently derided the attitude that there ought 
to be a Now on my world-line as “chauvin- 
ism of the present moment”. 

But the Now is neither an illusion nor a 
spurious manifestation of temporal chau- 
vinism. The problem of the Now is laid to 
rest by recognizing the mistake behind the 
conclusion that it is missing from our physi- 
cal description of the world. That is the very 
error that led us into the quantum muddle: 
the exclusion of personal experience from 
physical science. Einstein's pain at the inabil- 
ity of science to contain a Now was of a piece 
with his stubborn refusal to accept quantum 
mechanics as an adequate view of the world. 

Physicists reify space-time. They elevate 
it from a four-dimensional diagram used to 
record their experience into the kind of “real 
essence” that Bohr warned us not to seek. 
My space-time diagram lets me represent 
events from past experiences, along with 
deductions or conjectures about events that 
were not experienced or have yet to happen. 
By identifying my abstract diagram with an 
objective reality, I fool myself into regarding 
that diagram as a four-dimensional arena in 
which my life is lived. Actual experiences are 
spread out in time and in space, and actual 
clocks used to associate times with our expe- 
riences are extended physical objects. To rep- 
resent the rich spatio-temporal structure of 
human experience as mathematical points in 
a space-time continuum is a smart strategic 
simplification, but we ought not to confuse 
our actual experience with a cartoon. 

That there is a place for the present 
moment in physics becomes obvious when I 
take my experience of it as the reality it clearly 
is to me and recognize that space-time is an 
abstraction that I construct to organize such 


experiences. At any moment I can represent 
my past experience as my world-line, termi- 
nating in my Now. As it turns into a memory, 
Iexpand my diagram to contain my subse- 
quent Nows. The motion of my Now along 
my world-line reflects the fact that as my 
watch advances I acquire more experiences 
to record. 

This provides the place in physics for the 
Now of any one person. But could the prob- 
lem of the Now lie in relating the present 
moments of several different people? When 
you and I are communicating face-to-face 
I cannot imagine that a live encounter for 
me could be only a memory for you, or vice 
versa. When two people are together at an 
event, if the event is Now for one of them, 
then it is Now for both. Although this is only 
an inference for each person, I take it to be 
as fundamental a feature of two perceiving 
subjects as the Now is for a single subject. 

Our present moments must overlap at 
every one of our meetings — whenever we 
have a conversation, move apart and then 
come back together and have another con- 
versation. But throughout human history 
people have only moved at low speeds. The 
complicating effect of relativistic ‘time dila- 
tion’ — the slowing down of rapidly moving 
clocks — on the advances of our different 
individual Nows has been far too small to 
notice. We can, however, entertain the ques- 
tion of whether our present moments would 
coincide when we came back together, 


regardless of how rapidly we moved back 
and forth and regardless of how long the 
journey. 

It is a basic fact of relativity that my 
personal time — the progress of my present 
moment — keeps pace with the reading of 
my watch. If it did not, I would be aware 
that the rate of my watch had changed as it 
moved with me, in violation of Einstein’s 
(and Galileos) principle of relativity. This is 
all we need. Consider two twins. When they 
are together at home, their Nows coincide. 
Then Alice flies off to a nearby star at 80% of 
the speed of light, turns around and flies back 
home to Bob at the same speed. Relativity 
requires that if Bob’s watch has advanced ten 
years in the meantime, Alice’s has advanced 
only six. But because each of their present 
moments has advanced in step with the 
watch each is carrying, the moment of their 
reunion continues to be Now for them both. 

So it is incorrect to claim that physics has 
nothing to say about local Nows at single 
events. Physics predicts that our experi- 
ences of the Now will continue to have the 
same familiar features in a future world of 
interstellar travel at speeds near the speed 
of light, even for the distinct Nows of many 
different agents. 

Because it solves diverse conundrums 
in quantum mechanics as well as in the 
strictly classical problem of the Now, QBist 
(or CBist) thinking needs to be taken more 
seriously by physicists. It is time to consider 


what other foundational puzzles can be 
resolved by restoring the balance between 
subject and object in physical science. 

As another Viennese investigator even 
more famous than Schrédinger — Sigmund 
Freud — put it in 1927 (ref. 10): “The prob- 
lem of a world constitution that takes no 
account of the mental apparatus by which 
we perceive it is an empty abstraction.” = 


N. David Mermin is emeritus professor 
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State Physics, Cornell University, Ithaca, 
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Make mouse studies work 


More investment to characterize animal models can boost the ability of 
preclinical work to predict drug effects in humans, says Steve Perrin. 


ice take the blame for one of the 
most uncomfortable truths in 
translational research. Even 


after animal studies suggest that a treat- 
ment will be safe and effective, more 
than 80% of potential therapeutics 
fail when tested in people. Animal 
models of disease are frequently 
condemned as poor predictors of 
whether an experimental drug can 
become an effective treatment. Often, 
though, the real reason is that the pre- 
clinical experiments were not rigorously 
designed’”. 

The series of clinical trials for a poten- 
tial therapy can cost hundreds of millions 
of dollars. The human costs are even greater: 
patients with progressive terminal illnesses 
may have just one shot at an unproven but 
promising treatment. Clinical trials typically 
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require patients to commit to year or more 

of treatment, during which they are pre- 

cluded from pursuing other experimental 

options. Launching a clinical trial without 

the backing of robust animal data keeps 

patients out of tests for therapies that may 
have a better chance of success. 

One such group of patients is those 
with amyotrophic lateral sclerosis 
(ALS), the fatal neurodegenerative 
condition also known as Lou Gehrig’s 

or motor neuron disease. Over the 
past decade, about a dozen experimen- 
tal treatments have made their way into 
human trials for ALS. All had been shown 
to ameliorate disease in an established ani- 
mal model. All but one failed in the clinic, > 
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> and the survival benefits of that one are 
marginal. 

At the ALS Therapy Development Insti- 
tute (TDI) in Cambridge, Massachusetts, we 
have tested more than 100 potential drugs in 
an established mouse model of this disease 
(mostly unpublished work). Many of these 
drugs had been reported to slow down dis- 
ease in that same mouse model; none was 
found to be beneficial in our experiments 
(see ‘Due diligence, overdue’). Eight of 
these compounds ultimately failed in clini- 
cal trials, which together involved thousands 
of people. One needs to look no further than 
potential blockbuster indications such as 
Alzheimer’s and cancer to see that the prob- 
lem persists across diseases. 

After nearly a decade of validation work, 
the ALS TDI introduced guidelines that 
should reduce the number of false positives 
in preclinical studies and so prevent unwar- 
ranted clinical trials. The recommendations, 
which pertain to other diseases too, include: 
rigorously assessing animals’ physical and 
biochemical traits in terms of human disease; 
characterizing when disease symptoms and 
death occur and being alert to unexpected 
variation; and creating a mathematical model 
to aid experimental design, including how 
many mice must be included in a study. It is 
astonishing how often such straightforward 
steps are overlooked. It is hard to find a pub- 
lication, for example, in which a preclinical 
animal study is backed by statistical models 
to minimize experimental noise. 

The experiments necessary for this type 
of characterization are expensive, time- 
consuming and will not, in themselves, 
lead to new treatments. But without this 
upfront investment, financial resources for 
clinical trials are being wasted and lives are 
being lost. 


KNOW YOUR ANIMALS 

Investigations at the ALS TDI exemplify 
how initial physiological descriptions of an 
animal model rarely encompass all salient 
features, including how closely the model 
captures what is observed in patients. Such 
models are often inadequate for studying 


DUE DILIGENCE, OVERDUE 


Results of rigorous animal tests by the Amyotrophic Lateral Sclerosis Therapy Development Institute (ALS TDI) 
are less promising than those published. All these compounds have disappointed in human testing. 


Riluzole* : 


Creatine : 


Celebrex : 


Thalidomide | 


Ceftriaxone : 


Lithium : 


Minocycline : 
Sodium 
phenylbutyrate : 
Dexpramipexole : 
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Change in survival observed in mouse study (%) 


*Although riluzole is the only drug currently approved by the US Food and Drug Administration for ALS, our work showed no survival benefit. 
References for published studies can be found in supplementary information at go.nature.com/hf4jf6. 


how a drug affects various aspects of disease. 

ALS progression is characterized by a 
deterioration in the neurons that innervate 
skeletal muscles. Sequencing and genetic 
studies implicate RNA-binding proteins 
as crucial for maintaining the health of 
motor neurons’. Mouse models expressing 
a mutant form of the RNA binding protein 
TDP43 show hallmark features of ALS: loss 
of motor neurons, protein aggregation and 
progressive muscle atrophy’. 

But further study of these mice revealed 
key differences. In patients (and in estab- 
lished mouse models), paralysis progresses 
over time. However, we did not observe this 
progression in TDP43-mutant mice. Meas- 
urements of gait and grip strength showed 
that their muscle deficits were in fact mild, 
and post-mortem examination found that 
the animals died not of progressive muscle 
atrophy, but of acute bowel obstruction 
caused by deterioration of smooth muscles 
in the gut’. Although the existing TDP43- 
mutant mice may be useful for studying 
drugs’ effects on certain disease mecha- 
nisms, a drug’s ability to extend survival 
would most probably be irrelevant to people. 

Scientists who use animal models for 


FOUR WAYS TO FIGHT NOISE 


Simple steps to avoid spurious conclusions 


@ Exclude irrelevant animals As often done 
in clinical trials, subjects that die for reasons 
unrelated to disease (such as mishandling) 
should not be counted in results. Reasons 
for exclusion should be well documented. 


@ Balance for gender Males and females 
can show differences in symptoms that 
obscure modest drug effects. 


© Split littermates among experimental 
groups Putting siblings into the same 
treatment group can bias results. 


@ Track genes Genes that induce disease 
are often not inherited reliably. 

When copies are lost, symptoms 

can be less severe and drugs can seem 
more effective than they are. 
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translational research must proceed with 
caution, and be prepared to do further char- 
acterizations themselves. 


CANCEL THE NOISE 

ALS TDI scientists performed a meta- 
analysis on nearly 5,500 mice that had been 
used in treatment or control groups over 
four years'. All mice expressed a specific 
defective version of the SOD1 gene, which 
is mutated in about 10% of people with 
inherited ALS. This work, and that of oth- 
ers’, revealed both unexpected variation in 
the animals and ways to control for it. 

Almost 90% of the mice had an average 
lifespan of 134 days, give or take 10 days. 
Careful inspection of animals that lived 
shorter or longer revealed four factors that 
produced considerable noise in the data and 
could have led to spurious conclusions (see 
‘Four ways to fight noise’). Crucially, under- 
standing such variation requires careful 
monitoring of hundreds of mice over several 
generations. 

One factor is the failure to exclude animals 
whose deaths are unrelated to the disease 
being studied. Other factors are failing to 
split littermates between control and treat- 
ment groups, and not taking gender into 
account. Male SOD1 mice show symptoms 
as much as a week before females and die 
about a week earlier. Given that a week is a 
4% variability in survival, such differences 
could easily be misconstrued as a drug effect. 

The fourth factor regards the genes intro- 
duced to induce disease. All too often, a dis- 
ease phenotype is lost as a colony of breeding 
mice is built up. For many diseases, including 
ALS, animal models carry multiple copies of 
the disease-causing gene, and these repeated 
genes are often not passed on ina stable fash- 
ion as cells divide to make gametes. Regu- 
lar genotyping assays are essential to make 
sure that mice in subsequent generations do 


not have fewer copies of the transgene, and 
therefore less severe disease. 

At the ALS TDI we have seen this several 
times. When first described in 2010, all 
TDP43-mutant mice died within 200 days’. 
When we ordered mice from a breeding 
colony established from those used in this 
initial publication, the mice lived for up to 
400 days without showing signs of disease. 
To perform the characterization work on 
TDP43 described above, we first spent sev- 
eral months backcrossing the strain to create 
astable phenotype. 

Characterization can flag more subtle 
potential problems for translation. This is 
exemplified by a study showing that lithium 
can boost survival of SOD1 mice by 30 days, 
an astoundingly long time’. A small clini- 
cal trial showed that it also extended life in 
people with ALS*. Lithium is already sold to 
treat schizophrenia, and many people with 
ALS began taking the drug off label in hope 
of slowing down their disease progression. 
Three separate phase ITI clinical trials were 
launched in parallel to assess the drug’s 
effects. These enrolled hundreds of patients 
with a total cost of well over US$100 million. 
None of the three trials showed any thera- 
peutic benefit”. 

Concurrently, other groups attempted 
to reproduce the preclinical data and could 
not’””, Although it is difficult to determine 
why the first study showed such a dramatic 
effect, its initial results are curious. The 
median survival time of untreated animals 
was 20 days shorter than that observed else- 
where, suggesting other anomalies. 


For studies that aim to predict treatment 
benefits, such as extended survival or a delay 
of symptom progression, a mathematical sim- 
ulation is in order. This incorporates the vari- 
ation typically observed in an animal model 
to calculate how many animals should be 
assigned to the experimental groups. Accord- 
ing to our calculations, highly variable animal 
models could require hundreds of animals 
per group; even homogeneous ones require 
as many as ten. 

And before assessing a drug’s efficacy, 
researchers should investigate what dose 

animals can toler- 


“Public and ate, whether the 
privateagencies drug reaches the 
should fund relevant tissue at 
characterization the required dose 
studies as and how quickly the 
a specific drug is metabolized 
project.” or degraded by the 


body. We estimate 
that it takes about $30,000 and 6-9 months 
to characterize the toxicity ofa molecule and 
assess whether enough reaches the relevant 
tissue and has a sufficient halflife at the tar- 
get to be potentially effective. 

If those results are promising, then experi- 
ments to test whether a drug can extend 
an animal’s survival are warranted — this 
will cost about $100,000 per dose and take 
around 12 months. At least three doses of 
the molecule should be tested; this will help 
to establish that any drug responses are real 
and suggest what a reasonable dosing level 
might be. 

Thus, even assuming the model has been 
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adequately characterized, an investment 
of $330,000 is necessary just to determine 
whether a single drug has reasonable poten- 
tial to treat disease in humans. This seems 
worthwhile given that it could take thou- 
sands of patients, several years and hundreds 
of millions of dollars to move a drug through 
the clinical development process. 


COMMUNITY EFFORT 
As academic labs shift their focus to trans- 
lational research, the burden to characterize 
animal models will fall on them. Although 
the costs are meagre compared with those 
of clinical trials, the investment required in 
time and funds is far beyond what any one lab 
should be expected to do. This burden and 
the resulting mouse models should be shared. 
At the very least, researchers should place 
new animal models in a public repository so 
that other teams can repeat the characteriza- 
tion, and share the costs of doing it well. 
Public and private agencies should fund 
characterization studies as a specific project. 
A good example is the Alzheimer’s Disease 
Neuroimaging Initiative, a large, collabo- 
rative study to find diagnostic biomarkers 
of the disease. Competitive bidding and 
milestone-driven payments could persuade 
qualified groups to perform the necessary 
experiments and to make results publicly 
available. This is unglamorous work that 
will never directly lead to a breakthrough or 
therapy, and is hard to mesh with the aims of 
a typical grant proposal or graduate student 
training programme. However, without 
these investments, more patients and funds 
will be squandered on clinical trials that are 
uninformative and disappointing. m 


Steve Perrin is chief scientific officer at 
the ALS Therapy Development Institute in 
Cambridge, Massachusetts, USA. 
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J. D. Bernal (third from left) helped to define the ‘science of science’ as a discipline. 


The Social Function 


of Science 


Roger Pielke Jr assesses the legacy of J. D. Bernal’s 
science-policy classic on its 75th anniversary. 


Senator for Oklahoma, issued a report 

focused on helping the US National 
Science Foundation to better conduct 
research that “can transform and improve 
our lives, advance our understanding of the 
world, and create meaningful new jobs” It 
is ironic that this conservative Republican’s 
demands that research be carefully planned 
and focused on social objectives can be traced 
directly to the writings of an Irish-born 
communist crystallographer 75 years ago. 

Such is the wholesale acceptance of John 
Desmond Bernal’s views in his 1939 treatise 
The Social Function of Science — covering 
the organization of research to science and 
its social role — that they are now part of the 
fabric of science-policy debates across the 
political spectrum. For Bernal, usefulness 
was more than an aspiration: it was the central 
objective of the scientific enterprise and the 
desired end of state support of science. 

He was among the first to recognize that 
all public engagement is ultimately political, 
although his vision of scientists as stalwarts 
resisting partisan politics might now seem 


IE 2011 Tom Coburn, the Republican 


naive: “The scientist ... sees the social, 
economic and political situation as a prob- 
lem to which a solution must first be found 
and then applied, not as a battleground of 
personalities, careers and vested interests.” 

Bernal was the first to compile estimates 
of government-wide spending on science, 
several years before the first gauges of gross 
domestic product in the early 1940s. On the 
basis of such estimates, he concluded presci- 
ently that the United States was poised to take 
along-term leadership role in science. Today, 
much discussion of science policy (some 
would say too much) hinges on this kind of 
number-crunching; 75 years ago, it provided 
a fundamentally new lens through which to 
view the scientific enterprise. 

Bernal was born in Ireland in 1901. His 
formative years were marked by the First 
World War and the 1917 Russian revolution, 
which, along with the Great Depression in 
the 1930s, hada lasting 


tk The Social Function 
negative influence on 


<or ae of Science 
his view of capitalism. jp. BERNAL 
After earning adegree — George Routledge and 
in mathematics and Sons; 1939 
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BOOKS & ARTS 


science at the University of Cambridge, UK, 
in 1922, Bernal did his postgraduate training 
in X-ray crystallography before joining the 
Cambridge faculty in 1927. He became part 
of Britain’s left-wing intellectual elite, joining 
zoologist Solly Zuckerman’ dining club Tots 
and Quots (a reference to Roman playwright 
Terence’s “Quot homines, tot sententiae’, 
meaning ‘so many men, so many opinions’) 
with biologists Julian Huxley and J. B. S. Hal- 
dane, among others. Zuckerman became 
Britain’s first chief scientific adviser in 1964. 

Bernal started to write The Social Function 
of Science in 1938 after having “achieved a 
certain standing in the scientific community’, 
according to his biographer Andrew Brown 
(The Sage of Science; Oxford University Press, 
2007). He was far from the first to explore 
the nexus of science and society. For instance, 
the theme of the 1936 meeting of the British 
Science Association was ‘Science and Social 
Welfare’, and in 1937 the American Associa- 
tion for the Advancement of Science added 
“an examination of the profound effects of 
science upon society” as one of its objectives. 
Even so, Bernal’s book helped to define a new 
discipline: the science of science. 

The book was controversial for two rea- 
sons. First, Bernal was presenting a view of 
science that was directly at odds with the 
‘pure science ideal, in which scientists were 
expected to keep their distance from public 
affairs. Second was Bernal’ vision of science 
fulfilling its social function by supporting a 
centrally planned society. He even stated that 
“science is communism” and argued that the 
Soviet Union “was one State where the proper 
function of science was being realized”. 

In the debates that raged in scientific and 
other circles, which pitted scientific planning 
against freedom — essentially, socialist versus 
capitalist stances — Bernal’s arguments were 
often conflated with his support for the Soviet 
Union, frequently with his encouragement. 
A 1946 Nature Editorial sought, and largely 
failed, to find a middle ground: “for though 
we claim that the impact of science on society 
is now achieving such importance as to com- 
mand constant study which is bound to result 
in conscious planning, we are equally as con- 
vinced that it is the man of science who must 
be allowed to do the planning in consultation 
with others. In this way his freedom need not 
be impaired” (Nature 158, 565-567; 1946). 

Bernal’s great intellectual adversary was 
Michael Polanyi, a Hungarian chemist who 
was opposed to Soviet ideals. Polanyi's classic 
1962 journal article “The Republic of Science: 
Its Political and Economic Theory’ (Minerva 
1, 54-73; 1962) posits that individual scien- 
tists pursuing truth led to the most efficient 
social outcomes. The parallels with Adam 
Smith’s “invisible hand” guiding capitalist 
economies could not have been accidental. 

The decades-long debate between Bernal 
and Polanyi played out in many contexts. 
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> For instance, in 1943, when a more- 
organized approach to government- 
supported science was first being mooted 
in the United States, a Democratic Sena- 
tor for West Virginia, Harvey Kilgore, pro- 
posed creating an office to “mobilize the 
scientific and technical resources of the 
nation” A correspondence in The New 
York Times warned that this suggestion 
for government-directed science repre- 
sented “the latent germs of a form of com- 
munism that is most foreign to the ideals 
which our peoples are fighting to support”. 
After the Second World War, engineer 
Vannevar Bush, who oversaw the seminal 
1945 report Science —The Endless Frontier 
on US science policies, echoed Polanyi’s 
thinking, arguing for public resources 
with little public accountability. 

In the late 1940s, Bernal’s career began 
to fade with his endorsement of the theory 
of agricultural genetics propounded by 
Russian agronomist Trofim Lysenko. This 
was elevated to Soviet policy, yet quickly 
discredited by scientists in the West. In 
1948, Bernal debated against Polanyi on 
BBC radio, with Polanyi pointing to the 
suppression of science under Lysenko- 
ism as evidence of the shortfalls of the 
state planning of science. The BBC’s sub- 
sequent investigation revealed that nine 
Soviet geneticists had been put to death 
for expressing alternative views. Bernal’s 
continuing defence of the theory, and his 
glowing 1953 obituary of Joseph Stalin in 
the magazine Modern Quarterly, contrib- 
uted to his diminishing relevance. 

Although Bernal lost the intellectual 
battle over cold-war politics, his ideas 
on the social function of science have 
triumphed on nearly every count. The 
larger and more significant effect of 
The Social Function of Science has been 
to anticipate and help the ideal of ‘pure 
science’ to reach mythical status, usher- 
ing in an era of science focused on soci- 
etal needs, today characterized as ‘grand 
challenges’ by scientists and politicians. 

Bernal, looking back at the book 
25 years after its publication, emphasized 
that such needs implied more systematic 
thinking about science itself: “We need a 
strategy for research which must be based 
onascience of science.” Today, whether it 
isa US senator, a British prime minister or 
a Chinese president expressing a view that 
science must serve society, each is reflect- 
ing Bernal’ big idea. Seventy-five years 
on, that global influence on science-pol- 
icy thinking is his classic’s great legacy. m 


Roger Pielke Jr is director of the Center 
for Science and Technology Policy Research 
and professor of environmental studies at 
the University of Colorado Boulder. 
e-mail: pielke@colorado.edu 
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MICROBIOLOGY 


Majority rule 


Mark O. Martin relishes a stimulating tour of ‘little 


lives’, from fungi to bacteria. 


r | Ahe vast populations of the 
microscopic — warring, 
cooperating, dispersing 

and communicating — have 
a dizzying array of strate- 
gies and forms. These 
denizens of the universe 
at the far end of a micro- 
scope have much to 
teach humanity, their 
roles ranging from the 
digestion of food to 
the cycling of carbon 
and nitrogen in Earth's 
atmosphere. Several 
books have champi- 
oned matters microbial 
in recent years. The latest, 
biologist Nicholas Money’s 
The Amoeba in the Room, is 
filled with an impassioned fasci- 
nation for microscopic life around 
and within us, in both the prokaryotic 
and eukaryotic domains. Money recog- 
nizes that animals and plants are an “evolu- 
tionary afterthought” — as he writes, “the 
least part of life”. 

My interests tend toward the bacterial, 
viral and archaeal. Money covers this terri- 
tory thoroughly, and goes beyond to sing the 
praises of fungi, algae and protists. Among 
others, he evocatively describes the water 
mould Haptoglossa, which uses microbial 
artillery to attack and consume nematode 
worms; Cryptomonas, a “Russian nesting 
doll” of an alga, evolved from four different 
organisms; and Polychaos dubium, a huge 
amoeba with perhaps the largest amount 
of DNA to be found in a nucleus. Overall, 
Money delivers a heady mixture of history, 
philosophy, art and even poetry: the chap- 
ters are prefaced with 
lines from John Mil- 
ton’s Paradise Lost. 

Money begins with 
a ‘macro’ view of his 
garden pond, then 
dives into intricate 
details of the microbial 
populations in and j 
around it. The seem- 
ingly uninhabited 
water swarms with 


The Amoeba in the 
Room: Lives of the 


ws Microbes 
communities as CoM- —_\ICHOLAS P. MONEY 
plex as any seen with — Oxford University 
the naked eye, anda Press: 2014. 


© 2014 Macmillan Publishers Limited. All rights reserved 


Colonies of Volvox aureus algae, pregnant with 
daughter colonies. 


tree branch is revealed as home to popula- 
tions of interacting microbes. Money’s point 
is that the diversity of life is clear enough on 
the macroscopic scale, but the organismal 
diversity of the microbial world is stagger- 
ing. I particularly appreciated reading about 
the eight supergroups of microbial eukary- 
otes, including some familiar “animalcules” 
(such as the mitochondria-free pathogen 
Giardia, found among the Excavata). 

A chapter on microscopy takes the reader 
from Assyrian craftsmen to familiar names 
such as seventeenth-century microbiological 
pioneers Robert Hooke and Antoni van 
Leeuwenhoek, as well as characters with 
whom I was not familiar. Bénédict Prévost, 
for instance, presented evidence that 
microbes can cause disease 50 years before 
Louis Pasteur, whereas Henry Baker popu- 
larized the use of microscopes to observe tiny 
wonders in 1742. We meet Ed Ricketts, US 
marine biologist and co-author of Between 
Pacific Tides (1939), as well as sculpture-like 
coccolithophores, numerous Prochlorococcus 
and ubiquitous marine bacteriophages. As in 
the rest of the book, Money enthusiastically 
presents evidence of diversity everywhere, no 
matter the magnification. 


LEBENDKULTUREN.DE/SHUTTERSTOCK 


Money’s tour of the air 
and its inhabitants reveals 
an intriguing experiment 
conducted by US aviator 

Charles Lindbergh, famous 

for making the first solo non- 

stop transatlantic flight in 1927. 

In the 1930s, the US Department 

of Agriculture asked Lindbergh and his 

wife, Anne, to gather air samples in a bid to 
understand the dispersal of cereal rusts. The 
two collected abundant rust spores using a 
plane fitted with a sky-hook, a device incor- 
porating oiled microscope slides. Money 
also discusses how more recent research- 
ers have speculated that some bacteria and 
microscopic algae control cloud formation 
and rainfall. 

The co-evolution of human and microbe 
is amply explored. Those that cause disease 
or death — think pathogenic bacteria, flu 
viruses, malaria — have shaped our his- 
tory. But, as researchers have discovered 
in the past century, the human body could 
not function without the estimated thou- 
sands of species of bacteria that inhabit our 
mouths, guts and other recesses. Like the 
pond and branch that Money probes early 
on, a human can be seen as a collection 
of ecological niches, albeit a mobile one. 
Money goes on 


“Seemingly to discuss seem- 
uninhabited ingly inhospita- 
water ble environments 
swarms with (such as deep-sea 
communities hydrothermal 
and complex as vents, acidic areas 
any seen with or regions bom- 
the naked eye.” barded with high 


levels of radiation) 
in which microbes — suchas archaeans that 
can grow at 121 °C — prosper. He sums up 
with distressing prospects for extinctions 
and change, but also reassures: the micro- 
bial world is one of prodigious diversity, 
power and resilience. 

This is a lucid and informative book. 
There is an impressive afterword of refer- 
ences and notes, and fine line drawings. So 
much that is lyrical and little-known waits 
to be discovered here — novelties that will 
appeal to new undergraduates as well as 
to incorrigible microbial enthusiasts like 
myself. m 


Mark O. Martin is associate professor of 
biology at the University of Puget Sound in 
Tacoma, Washington. 

e-mail: momartin@pugetsound.edu 
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Books in brief 


Aluminum Dreams: The Making of Light Modernity 

Mimi Sheller MIT PRESs (2014) 

It propelled humanity into air and space, transformed 
communications and fed war machines. Aluminium — in aeroplanes, 
space capsules, satellites, bombs and baking foil — is welded into 
the built world. Mimi Sheller’s coruscating cultural study reveals how 
young US chemist Charles Martin Hall and his French counterpart 
Paul Héroult simultaneously discovered the electrolytic production of 
aluminium in 1886; how designers were galvanized by its potential 
for the light and sleek; and how social and environmental problems 
from bauxite mining and aluminium smelting persist. 


Plato at the Googleplex: Why Philosophy Won’t Go Away 

Rebecca Goldstein PANTHEON (2014) 

Philosopher Rebecca Goldstein probes why Plato — and 

the philosophical enterprise itself — remains a force in science and 
culture 2,400 years on. Into a weighty discussion of the Platonic 
world view Goldstein inserts fictional interludes that see Plato, 
Chromebook in hand, touring the Googleplex, a neuroscience lab 
and beyond. Although contrived, this thought experiment usefully 
casts an eye on our turbocharged century. And it shows what 
survives of this classical titan: an ability to plumb the deep questions 
we still grapple with, from the nature of knowledge to morality. 


The Age of Radiance: The Epic Rise and Dramatic Fall of the 
Atomic Era 

Craig Nelson SCRIBNER (2014) 

Tomes on things atomic are rife — from discoveries by seminal 
physicists to meltdowns and crumbling nuclear arsenals. Craig Nelson 
braids the strands into a comprehensive chronicle that is also a wry 
elegy for an ebbing age. Starting with Wilhelm Rontgen’s discovery of 
X-rays, Nelson gives us the panoply of physicists from Lise Meitner to 
Leo Szilard; offers a masterful reading of Hiroshima and Nagasaki; and 
takes us through weapons tests, the cold war, Fukushima, and the rest 
of our long attempt to “live with blessed curses”. 


The Thing with Feathers: The Surprising Lives of Birds and What 
They Reveal About Being Human 

Noah Strycker RIVERHEAD (2014) 

Birds intrigue humanity, and in this research round-up Noah 
Strycker reveals why — in marvels such as the equal-radius flight 
paths of flocking starlings and the decontamination chamber that 
is a vulture’s stomach. As he notes, such findings can mirror human 
realities. Clark’s nutcrackers, for instance, can recall the location 

of 5,000 seed caches. Yet, in captivity some wild species lose 
substantial hippocampal volume (and memory) within weeks — an 
echo of the rapid losses found in patients with Alzheimer’s disease. 


The Insect Cookbook: Food for a Sustainable Planet 

Arnold van Huis, Henk van Gurp and Marcel Dicke COLUMBIA UNIVERSITY 
PRESS (2014) 

The looming squeeze on food security is forcing some scientists 

to look at the abundance of protein under our feet: insects. Here 
entomologists Arnold van Huis and Marcel Dicke team up with 

chef Henk van Gurp for a pragmatic introduction to entomophagy, 
covering insect farming, nutrition and cuisine. Tarte tatin with 
chocolate-coated grasshoppers? With 2 billion of us already popping 
mealworms and more, this is a case of joining the crowd. BarbaraKiser 
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Before symbols, complex formulae were expressed in words. 


Numbers game 


George Szpiro enjoys a history of the slow but 
irresistible rise of mathematical symbols. 


olving an equation such as 2x-4=10 

is almost ludicrously easy. The symme- 

try is visible; we just perform identical 
manipulations on both sides of the equals 
sign until the symbol for the unknown value, 
x, remains isolated on the left. But it was not 
always that simple, as mathematician Joseph 
Mazur reveals in his history of maths nota- 
tion, Enlightening Symbols. 

Until the late Renaissance, there were few 
symbols to depict clearly and concisely what 
would later become known as an equation. 
Numbers and numerical problems were 
framed in words and sentences. Readers, 
mostly monks and scholars, had to form 
mental pictures of the problems at hand to 
solve, for example, quadratic equations. In 
the late nineteenth century, even economists 
still shunned equations and mathematical 
notation, preferring narrative and anecdotes. 
In his fascinating narrative, Mazur places the 
development of mathematical symbolism 
squarely within cultural evolution. 

In the first part of Enlightening Symbols, 
‘Numerals, Mazur traces the depiction of 
numbers, beginning more than five millennia 
ago. First, Sumerian farmers calculated the 
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areas of fields using cuneiform numbers; later, 
Egyptian priests used hieroglyphs for their 
calendars; and in medieval Europe, astrono- 
mers, merchants and officers of courts went 
about their daily business with abacuses. The 
chronicle leads from counting on ten fingers, 
through Egyptian, Hebrew, Greek, Chinese 
and Roman numerals, to the positional sys- 
tem that we use today, with units in the right- 
hand column, tens in 
the next column to the 
left, hundreds in the 
next, and so on. 

A revolution was 
fomented in the sev- 
enth century aD by 
Hindu scholars, who 
invented zero both as 
a number to denote 
nothingness and as 
a placeholder for the 
positional system. 
Dissemination of 
the system in Europe 
began with transla- 
tions into Arabic in the 
late eighth century, and 


Enlightening 
Symbols: A 
Short History of 
Mathematical 
Notation and Its 
Hidden Powers 


JOSEPH MAZUR 
Princeton University 
Press: 2014. 
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into Latin in the early twelfth century. Mazur 
discusses many notable early mathemati- 
cians, including the Indian Brahmagupta in 
the seventh century, the Persian Muhammad. 
ibn Musa Al-Khwarizmi in the ninth and the 
Italian Leonardo Pisano Bigollo — better 
known as Fibonacci — in the thirteenth. 

But numerals are only part of the story. 
There must also be signs to indicate negative 
and imaginary numbers, symbols to denote 
special constants, characters to represent 
unknowns and parameters, and pictograms 
for operators. Mazur’s second section, ‘Alge- 
bra, covers these. In ap 830, Al-Khwarizmi 
asked: “What must be the amount of two 
squares which, when summed up and added 
to ten times the root of one of them, make 
up asum of forty eight dirhems?” Compare 
such verbosity to our concise 2x*+ 10x=48, 
and then imagine the even more frightening 
impression left on readers by the Italian math- 
ematician Scipio del Ferro’s wordy prescrip- 
tion of how to solve cubic equations in 1505. 

About 50 years later, the Welsh physi- 
cian Robert Recorde introduced the equals 
sign, =, “to auoide the tediouse repetition of 
these woordes: is equalle to ... because noe. 
2. thynges, can be moare equalle” than a pair 
of parallels. Recorde was following a trend to 
symbolize algebra, first with word abbrevia- 
tions and later with specialized symbols, that 
had been started by mathematicians in the 
Maghreb about two centuries earlier. 

“The Power of Symbols; the book’s third 
part, is the most challenging. In it, Mazur 
turns to the theory of mind. Does one need 
language to have mathematical thought? 
Drawing on material from luminaries includ- 
ing writer Fyodor Dostoyevsky, philosopher 
Ludwig Wittgenstein, cognitive neuroscien- 
tist Stanislas Dehaene and psychologist Dan- 
iel Kahneman, he wrestles with the question. 

In literature, for example, meaning comes 
from associative experiences, and so it is at 
times with reading mathematics. Mazur 
argues that symbols, like poems, make 
connections between experience and the 
unknown to convey meaning. But when 
symbols acquire meaning, he warns, one 
may lose sight of the object represented, 
and continue with mechanical manipula- 
tions without grasping the essence of what 
is being done — as when one juggles equa- 
tions that contain 2 without comprehending 
the connection to a circle’s circumference. 

This is a nuanced, intelligently framed 
chronicle packed with nuggets — such as 
the fact that Hindus, not Arabs, introduced 
Arabic numerals. In a word: enlightening. m 


George Szpiro is a correspondent for 

the Swiss daily newspaper Neue Ziircher 
Zeitung. He wrote Pricing the Future, and 
is working on a history of mathematical 
economics and decision-making. 

e-mail: georgeszpiro@gmail.com 
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Swiss science set to 
stay international 


Curbs on immigration 

resulting from last month's 

Swiss referendum (see Nature 
506, 265; 2014) have led to 
Switzerland’s exclusion from 

the competition for European 
Research Council (ERC) grants. 
This is a devastating blow, given 
that the country has the highest 
share of international researchers 
on ERC grants. Thanks to 

the Swiss National Science 
Foundation (SNSF) stepping 

in to run a temporary parallel 
programme, however, Swiss 
science can remain international 
and competitive. 

The SNSF has supported 
basic science through single- 
investigator grants for more than 
60 years. The new temporary 
SNSF schemes will help 
researchers who are working in 
Switzerland or negotiating with 
Swiss institutions and who were 
planning to participate in the 
ERC competition. The schemes 
will have similar deadlines, 
procedures and success 
rates to the ERC’s, and the 
evaluation panels will comprise 
distinguished scientists from 
Switzerland and abroad. 

ERC grants represent a sort 
of ‘Champions League’ for 
researchers in Europe, and so 
it will not be easy to run a local 
programme of the same calibre. 
However, Switzerland will now 
be able to maintain its strong 
international track record: 

49% of our professorial bodies 
are international, 33% of our 
researchers are abroad, and 28% 
of our students are foreigners. 
Martin Vetterli SNSE Bern, 
Switzerland. 
martin.vetterli@snf.ch 


Publish results from 
volunteer computing 


In your discussion of waning 
participation in volunteer 
computing projects (Nature 
506, 16-17; 2014), you omit to 
mention the most important 


motivator of all — contribution 
to scientific progress. 

For instance, a review of 
completed projects on IBM’s 
World Community Grid for 
biomedical research uncovers 
a lacklustre track record in 
scientific output (see go.nature. 
com/fkheej). Four out of 16 
of these projects have yet to 
produce any publications, even 
though the earliest project was 
completed more than 5 years 
ago. Others have generated only 
non-peer-reviewed output, such 
as book chapters or conference 
proceedings. Of the peer- 
reviewed publications, several 
focused solely on the technical 
aspect of distributed computing, 
with no analysis of the computed 
results. 

To maintain the viability 
of volunteer computing 
projects, researchers should be 
reminded that public support 
and confidence are gained 
through professional and timely 
communication of results. 
Otherwise, online titles or 
badges amount to no more than 
virtual candy. 

Herman Tse University of Hong 
Kong, Hong Kong. 
htse@hku.hk 


Thwart fatal infant 
gut parasite 


Vaccination against 
cryptosporidiosis, a devastating 
disease for children in the 
developing world caused by 
a protozoan parasite, has met 
with limited success (P. Hotez 
Health Affairs 30, 1080-1087; 
2011). A better way to prevent 
this and other enteric diseases 
in such settings is to address 
shortcomings in hygiene. 
When funding is in short 
supply, more investment in basic 
research is not the only solution 
(see B. Striepen Nature 503, 
189-191; 2013). Increasing 
efforts to extend interventions 
that are known to work 
already are more likely to bring 
immediate, tangible and cost- 
effective benefits. 


The most significant gains will 
be achieved through education 
and the design, deployment and 
uptake of suitable sanitation and 
potable-water systems within the 
affected communities. 

Brendon King, Paul Monis 
Australian Water Quality Centre, 
Adelaide, Australia. 
brendon.king@sawater.com.au 


Steering plans for 
neglected diseases 


Mary Moran criticizes plans by 
the World Health Organization 
(WHO) for tackling neglected 
diseases in the developing world 
(Nature 506, 267; 2014). Her 
arguments are misleading and 
trivialize global efforts to fix 
the research and development 
(R&D) system that is failing 

to address the health needs of 
people in these countries. 

After 10 years’ effort by 
WHO member states, the WHO 
Consultative Expert Working 
Group on R&D Financing and 
Coordination has proposed a 
framework to tackle these RkD 
shortcomings. This aims to 
promote and coordinate needs- 
driven innovation, sustainable 
funding mechanisms and 
patient access to health 
technologies. 

Moran contends that 
the current drug pipeline 
for neglected diseases is 
“successful” In the past ten 
years, however, only 4% of all 
new drugs and vaccines and 
1% of all new chemical entities 
were for neglected diseases — 
and none was suitable for the 
17 neglected tropical diseases 
(B. Pedrique et al. Lancet Glob. 
Health 1, e371-e379; 2013). 
Increased funding and better 
coordination will help, but what 
is really needed is a greater 
global incentive for needs- 
driven R&D. 

To fulfil the working group’s 
criteria, the WHO pilot projects 
need better coordination, 
optimization of research and 
accelerated delivery of health 
tools. This will be achieved 


only by strengthening capacity, 
testing innovative or pooled 
financing, developing an 
open-knowledge approach, and 
unlinking R&D costs from the 
end price of products. 

Bernard Pécoul Drugs for 
Neglected Diseases initiative, 
Geneva, Switzerland. 
bpecoul@dndi.org 

Manica Balasegaram Médecins 
Sans Frontiéres, Geneva, 
Switzerland. 


Research problems 
in Portugal run deep 


As founders of the Portuguese 
National Association of 
Researchers in Science and 
Technology (ANICT), we 
believe that research problems 
in the country run deeper than 
just funding (see Nature 507, 
306; 2014). 

Evaluation procedures for 
recruitment, career progression 
and funding allocation are in need 
of fundamental reform. They 
must become more transparent 
and be based solely on merit. 

Selection for nationally 
funded PhD and postdoc 
scholarships, for example, is 
beset by conflicting interests and 
insufficient transparency (see 
www.anict.pt). The recent cuts 
in scholarship numbers have 
exacerbated these factors. 

Currently, merit has little 
influence on career stability or 
advancement. Many researchers, 
even team leaders with students 
and ongoing grants, are being 
driven to leave the country 
because of their non-renewable, 
short-term contracts. 

Instead of bickering and 
passing the buck, Portugal's 
politicians need to agree on 
long-term plans to foster and 
retain the country’s talented 
young researchers. 

Nuno Cerca University of Minho, 
Braga, Portugal. 
nunocerca@ceb.uminho.pt 

Joao Lopes University of Lisbon, 
Portugal. 

Miguel Jorge University of 
Strathclyde, Glasgow, UK. 
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Updating views of visual updating 


Our brains create a stable view of the world even though our eyes dart around. A study of how the brain might compensate 
for eye movements reveals an unexpected twist in the vision-stabilizing mechanism. SEE LETTER P.504 


JOHN A. ASSAD 


ision is the great illusionist. Although 
\ | scenes before our eyes seem vivid and 
detailed, the part of the eye dedicated 
to high-acuity vision, the fovea, can cover only 
a narrow Sliver of visual space — little more 
than the breadth of a thumbnail held at arm’s 
length. Vision seems so detailed because we 
constantly move our eyes to scan the high- 
acuity fovea across a scene. Saccades — quick, 
jerky movements of the eyes — occur several 
times per second, filling in the perceptual 
fogginess of the visual periphery. 

Given this perpetual motion of the eyes, how 
the visual world looks so stable is an enduring 
mystery. The retina responds whenever the 
visual scene ‘slips’ across its surface, and is thus 
unable to distinguish between the scene flash- 
ing in front of the eyes (for example, a speeding 
train at a railway crossing) or the eyes rushing 
across the scene. The brain, however, generates 
eye movements, and thus could perceptually 
compensate for saccades’. On page 504 of this 
issue, Zirnsak et al.’ offer insight into how this 
compensation could occur. 

In 1992, the neuroscientist Michael Goldberg 
and his colleagues suggested a potential neu- 
ronal mechanism for eye-movement compen- 
sation’. These researchers studied neurons in 
the macaque monkey’s lateral parietal cortex, a 
part of the brain that serves as a bridge between 
vision and eye movements. Parietal neurons, 
like other visual neurons, have a receptive 
field (RF), a circumscribed part of the visual 
field for which visual stimuli activate electri- 
cal responses (Fig. 1). These RFs are defined 
relative to the fixation position (the point in 
visual space where the fovea is focused when 
eyes are stationary), and are usually considered 
spatially fixed relative to that position, reflect- 
ing the convergent hard-wired inputs to the 
neurons that ultimately stem from the retina. 
Thus, if the fixation position is displaced by a 
saccade, the RF should ‘move’ as well, in lock- 
step with the fixation position. 

However, Goldberg and co-workers made 
the important discovery that parietal RFs are 
more fluid around the time of saccades: when 
monkeys made saccades to a point on a com- 
puter screen, even before the eyes moved, the 
neurons could be activated by a spot of light 


Figure 1 | Movement of receptive fields. a, Visual neurons have a receptive field (RF), a region of the 
visual field for which stimuli activate electrical responses. RFs are defined relative to the position fixated 
upon by the fovea, the tiny area of the eye that provides high-acuity vision. Here, the positions of three RFs 
are shown when the fovea is fixated at the red circle. b, After the fovea establishes fixation at a new point, 
the RFs move in lock-step with the new fixation position. Previous studies suggested that RFs shift to their 
new positions before the eyes move, a process known as predictive remapping. c, Zirnsak et al.” report that 
the RFs transiently shift to the upcoming fixation point itself, instead of to the future RF locations. 


that fell at the future position of the RF relative 
to the upcoming fixation point after the sac- 
cade. The researchers thus suggested that the 
RF was “updated” or “remapped” to the new 
position, perhaps to compensate perceptually 
for the upcoming eye movement. Similar spa- 
tial updating was subsequently found in other 
brain structures that have mixed visual and 
oculomotor (eye-movement) function, such as 
the frontal eye fields* (FEFs) and the superior 
colliculus’, suggesting a common compensa- 
tory mechanism. 

A strongly held notion about the spatial- 
updating hypothesis is that the brain’s entire 
representation of the visual field rigidly trans- 
lates in preparation for the abrupt movement 
of the eyes. However, in the original studies of 
spatial updating, visual stimuli were focused 
almost exclusively on the expected upcoming 
location of the RE. Zirnsak and colleagues 
have now widened the net in the simplest 
way imaginable: they presented visual 
stimuli to macaque monkeys not only at the 
upcoming (post-saccade) location of the RF of 
FEF neurons, but at various positions all over a 
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computer screen, to map the entire RE 

Surprisingly, when the full RFs were 
revealed, the authors found that these fields 
did not rigidly translate before the eye move- 
ment, but instead transiently shifted en masse 
towards the location of the upcoming saccade 
target (Fig. 1c; see also Fig. 3a of the paper’). 
Some of the shifts of individual RFs were sub- 
stantial, up to 18 degrees of visual angle. More- 
over, in the few cases in which the original 
RF was located farther away from the origi- 
nal fixation point than the saccade target (as 
measured along the direction of the saccade), 
the shift in the RF to the saccade-target loca- 
tion was actually in the opposite direction to 
that expected for a rigid translation of the RF 
relative to the fovea. 

The discovery that RFs collapse onto the 
saccade target is drastically different from the 
original spatial-updating hypothesis. So why 
did the authors of the original studies find 
neuronal responses at the upcoming position 
of the RF? In fact, the RFs of neurons in the 
FEF (and related visual-oculomotor struc- 
tures) are quite large, so even if the RFs 


collapsed to the saccade target, the fringes of 
many of those RFs would have probably over- 
lapped with the upcoming RF location, and 
would thus respond to stimuli at that location. 
Indeed, in the original studies, responses at the 
upcoming RF location were often quite weak, 
which is consistent with the typically weaker 
responses at the periphery of RFs. 

Zirnsak and colleagues’ findings raise many 
issues. For one, RF shifts to a saccade target 
resemble the RF shifts that occur towards tar- 
gets on which our attention is focused, even 
if the eyes do not move*”. The two phenom- 
ena are probably related, inasmuch as shifts 
of attention to targets of interest precede eye 
movements. In this perspective, visual stability 
during saccades could result from the fact that 


SOLAR SYSTEM 


Stranded in 


we effectively ignore those parts of the visual 
scene that are away from the saccade target. 

Moreover, spatial-perceptual distortions are 
known to occur owing to attention shifts and 
eye movements. These could be interesting 
phenomena to test further in monkeys, which 
can signal complex perceptions in experimen- 
tal settings. More studies will also be needed to 
understand the cellular mechanisms of rapid 
RF modulation before saccades. But for experts 
and non-experts alike, Zirnsak et al. provide us 
with a valuable lesson: interesting things are 
often revealed when we search for our prover- 
bial lost keys away from the streetlight. m 


John A. Assad is at the Center for 
Neuroscience and Cognitive Systems, Istituto 


no-man’s-land 


The discovery of a second resident in a region of the Solar System called the inner 
Oort cloud prompts fresh thinking about this no-man’s-land between the giant 
planets and the reservoir of comets of long orbital period. SEE LETTER P.471 


MEGAN E. SCHWAMB 


decade after its discovery, Sedna' still 
Aw: one of the strangest objects in 

the Solar System. This remote icy body 
is on a highly eccentric orbit that extends to 
about 1,000 astronomical units (Au; 1 AU 
is the mean Earth—Sun distance) and has a 
perihelion (point of closest approach to the 
Sun) of 76 Av. Its orbit is well beyond the 
reach of Neptune, which is located at 30 au, 
and is a long way from the edge of the Solar 
System where the Oort cloud, the reservoir 
of long-orbital-period comets, resides at 
about 10,000 au. Although other potential 


Asteroid 
belt 


Figure 1 | The inner Oort cloud. Trujillo and Sheppard’ have detected an 
object, called 2012 VP,,;, that joins Sedna as the second confirmed body of the 
inner Oort cloud, a region believed to lie between the disk-shaped Kuiper belt 
of icy bodies and the spherical Oort cloud of long-orbital-period comets. Sedna 


candidates have been found’, Sedna had 
remained the solitary confirmed member ofa 
proposed inner Oort cloud beyond 70 au. On 
page 471 of this issue, Trujillo and Sheppard 
report the discovery of an object, called 
2012 VP,,;, which joins Sedna as the second 
confirmed member of the inner Oort cloud. 
The finding solidifies the existence ofa popula- 
tion of icy bodies probably ranging in size from 
a few toa thousand kilometres. 

To all intents and purposes, in the current 
architecture of the Solar System, Sedna and 
2012 VP, ,; should not be there. These objects 
are in a no-man’s-land between the giant 
planets and the Oort cloud where nothing 
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Italiano di Tecnologia, Rovereto 38068, Italy, 
and the Department of Neurobiology, Harvard 
Medical School, Boston, Massachusetts. 
e-mail: jassad@hms.harvard.edu 
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in the known configuration of the modern- 
day Solar System could have emplaced them. 
Effectively frozen in place and untouched as 
the Solar System evolved to its present state, 
their orbits preserve the dynamical signature 
of whatever event scattered these bodies to 
such distances and detached them from the 
giant-planet region. With Sedna’s discovery 
in 2003, several formation mechanisms were 
put forth, each predicting different orbital 
distributions of inner Oort cloud objects. But 
with only a single object, it was impossible to 
unambiguously distinguish between the dif- 
ferent hypotheses. 

Proposed mechanisms to explain the origin 
of the inner Oort cloud include the scattering 
of planetesimals (the building blocks of planet 
formation) by a distant planet beyond Neptune 
that may have been ejected from our Solar 
System, or by the passage of a single star, located 
at between 500 and 1,000 au, early on in the 
Solar System’s history®’. The preferred mecha- 
nism is the gravitational scattering of leftover 
planetesimals from close encounters between 
the nascent Sun and other members of its 
birth star cluster, which would have contained 
between 10 and 1,000 stars®**"°. Most stars are 
born in stellar nurseries that dissolve within 
a few million years. At the present age of the 


Oort cloud (becomes 
spherical beyond about 5,000 au) 


it 0 | 
10,000 au 
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is roughly 1,000 km across, whereas 2012 VP; is estimated to be about 400 km 
in diameter. Perihelion is the orbital point of closest approach to the Sun; 
aphelion is the orbital point farthest from the Sun; 1 au is the mean Earth-Sun 
distance. Objects in this diagram are not to scale. (Figure adapted from ref. 12.) 
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Solar System, the Sun and other stars of its birth 
cluster would be spread across the Galaxy. There 
is no direct way to identify whether the Sun was 
ever in a stellar nursery or to find its siblings. 
But indirectly, the mark from those early stellar 
encounters, like fingerprints at a crime scene, 
could provide evidence for the sculpted distri- 
bution of orbits in the inner Oort cloud. 

Trujillo and Sheppard’s detection of 
2012 VP,,;, which was made using the Dark 
Energy Camera (DECam) at the Cerro Tololo 
Inter-American Observatory 4-metre tele- 
scope in Chile, brings us one step closer to 
reading the dynamical record of the inner Oort 
cloud. With its farthest distance from the Sun 
at 452 au and coming no closer into the Solar 
System than 80 Au, 2012 VP,,; is placed well 
within the expected inner Oort cloud out- 
side the Kuiper belt, a disk-shaped region of 
small icy bodies that lies beyond Neptune’s 
orbit (Fig. 1). 

It is still true that, with two objects, we 
cannot unambiguously identify the origin 
of the inner Oort cloud, but Trujillo and 
Sheppard find that the orbits of 2012 VP,,; and 
Sedna are consistent with models for the Sun’s 
birth cluster and constraints from previous 
ground-based, wide-field surveys''. This sug- 
gests that Sedna and 2012 VP,,; are the tip of 
the iceberg for this population of distant inner 
Oort cloud objects. Most objects in the 
inner Oort cloud would reside at distances 
farther than Sedna and 2012 VP,,;. For only 
an extremely small fraction of their orbits 
around the Sun would these inner Oort objects 
be bright enough to be detected in current 
ground-based surveys. 

But there is more to the authors’ study than 
described so far. Trujillo and Sheppard noticed 
a peculiarity with the orbits of Sedna and 2012 
VP,,3. The two objects have similar values 
for one of their orbital parameters: the angle 
between the point of perihelion and where 
the orbit crosses the plane of the Solar System. 
Interestingly, the most distant Kuiper-belt 
objects, with orbital semimajor axes greater 
than 150 au and perihelia beyond Neptune, 
also seem to have values for such angles com- 
parable to those of Sedna and 2012 VP,,3. 
Such clustering of orbital angles seems to be 
unexplainable by the gravitational influence 
of Neptune alone. This result may be the first 
hint we have of an identifiable signature of the 
inner Oort cloud’s formation mechanism on 
the orbits of closer-in Solar System bodies. If 
true, any formation mechanism proposed for 
the origin of Sedna and 2012 VP,,; will need 
to explain this orbital structure. More detec- 
tions of objects both in the Kuiper belt and the 
inner Oort cloud will be needed to confirm 
this result. 

Our knowledge of the inner Oort cloud is, 
in many ways, in the same state as the study 
of the Kuiper belt was in the 1990s, when the 
first Kuiper-belt objects were discovered — 
62 years after Pluto’s detection. 2012 VP,,; is 


the smoking gun for the existence of the inner 
Oort cloud. We now know that Sedna is not 
alone. The prospects of future detections using 
next-generation instruments and facilities, 
such as the Hyper Suprime-Cam on the 8.2-m 
Subaru telescope in Hawaii and the 8.4-m 
Large Synoptic Survey Telescope in Chile, 
are promising. With greater sky coverage and 
depth than that obtained by Trujillo and Shep- 
pard, we will find more of these distant bod- 
ies lurking in the shadows and will begin to 
unravel the origin of the inner Oort cloud. = 
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Methane minimalism 


A meta-analysis of methane emissions at the ecosystem level reveals a simple 
exponential dependence on temperature, despite the complex array of factors 
that control this process. SEE LETTER P.488 


TORI M. HOEHLER & MARC J. ALPERIN 


ethane is the third-largest contributor 
M: the greenhouse effect, after water 

vapour and carbon dioxide. The 
atmospheric concentration of methane rose 
for much of the twentieth century, was steady 
between 1999 and 2006, and is now again ris- 
ing, at a rate of 0.4% annually’. The cause of 
this resumption is not fully understood, but it 
is probably related to a surge in methane emis- 
sions from wetlands — nearly half of global 
methane emissions comes from wetlands and 
rice paddies that are expected to be subject to 
temperature-related and other feedbacks from 
global climate change. Although a complex set 
of factors influences methane emission at the 
ecosystem level, Yvon-Durocher et al. 2 report in 
this issue (page 488) that the average response 
of methane emissions to temperature across a 
range of ecosystems is well described by a sim- 
ple mathematical relationship. Could it be as 
straightforward as that? 

Most natural methane emissions originate 
with microorganisms called methanogens, 
the metabolic rates of which, during growth 
in culture with unlimited substrate, change 
with temperature according to the Arrhenius 
equation — a simple exponential dependence 
of rate constant on the reciprocal of absolute 
temperature. Such behaviour is unsurprising: 
although methanogenesis comprises a network 
of enzyme-catalysed reactions, the Arrhenius 
dependence reflects the kinetics of a single 
rate-limiting step. But Yvon-Durocher and 
colleagues’ report of the same temperature 
dependence at the ecosystem level is notable, 
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because an array of physical, chemical and 
ecological factors controls the production of 
methane and its release to the atmosphere’. 

Methanogenesis in soils and sediments is 
ultimately fuelled by complex organic matter; 
what proportion of this organic matter is con- 
verted to methane, and at what rate, depends 
as much on ecosystem dynamics as on the 
enzyme kinetics of methanogens (Fig. 1). For 
example, organic carbon can be converted to 
carbon dioxide, rather than methane, when 
oxidants such as oxygen, nitrate, iron(111) and. 
sulphate are available to fuel methanogens’ 
microbial competitors. And when organic 
matter does get converted to methane, it must 
first be broken down by other microbes into 
the few simple substrates that methanogens 
can metabolize — an upstream supply that can 
limit the rate of methanogenesis. 

Transport of methane from the site of 
production to the atmosphere is also subject to 
effects that might obscure a purely biochemi- 
cal temperature dependence. Methane is con- 
sumed in both aerobic and anaerobic microbial 
processes, and transport through zones in 
which such consumption occurs can markedly 
reduce net emissions. But consumption can 
largely be bypassed when physical and meta- 
bolic factors combine to promote ebullition — 
the formation and release of methane bubbles, 
a sight not uncommon in productive fresh- 
water lakes and swamps. The vascular system 
of plants can also facilitate methane transport, 
when roots penetrate the methane-producing 
portions of soils and provide a direct conduit to 
the atmosphere; but these same conduits also 
transport oxygen that inhibits methanogenesis 
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Figure 1 | Emissions network. Methane (CH,) is generated by microorganisms (methanogens) that 
metabolize substrates produced through the breakdown of complex organic matter by extracellular 
enzymes and fermentative microorganisms. The rate of methane emission to the atmosphere is influenced 
by these microorganisms individual sensitivities to temperature (indicated by E,), and by chemical 
conditions, such as oxygen availability, that divert the flow of carbon to microbial competitors that oxidize 
organic matter to carbon dioxide. Methane emission also depends on gas transport by diffusion, by bubble 
ebullition and in the vasculature of plants, and on the fraction of methane that is consumed by methane- 
oxidizing microorganisms. Despite this complex array of factors, Yvon-Durocher et al.’ report that the 
response of methane emissions at the ecosystem level can be described by the simple Arrhenius relationship. 


and promotes methane consumption. 

In their analysis of 127 studies of the eco- 
system-level dependence of methane emission 
on temperature, Yvon-Durocher et al. acknowl- 
edge this complex array of factors, but conclude 
that the aggregate temperature response is 
nonetheless described by the Arrhenius equa- 
tion, with an apparent activation energy (E,) 
of 0.96 electronvolts, similar to the 1.10 eV 
observed in pure cultures of methanogens. E, is 
a measure of temperature sensitivity; for exam- 
ple, 0.96 and 1.10 eV correspond, respectively, to 
a 3.5- and 4.2-fold increase in rate constant for 
an increase in temperature from 20°C to 30°C. 

Statistically speaking, the large number of 
studies considered allows for a confident state- 
ment that the calculated mean E, (0.96 eV) 
accurately reflects the mean temperature 
sensitivity of methane-emitting ecosystems 
— assuming that the sites that comprise the 
data set represent a random sample of all such 
environments. But the impact of factors other 
than temperature seems evident in the scatter 
and spread of the individual data sets consid- 
ered by the authors. For example, about 40% 
of the studies considered had Arrhenius-plot 
correlation coefficients (r’) of less than 0.5, 
which indicates that less than half of the vari- 
ance in those emission data is explained by the 
Arrhenius relationship, and about 10% of the 
studies measured methane emissions that were 
higher at lower temperatures (opposite to the 
effect predicted by the Arrhenius equation). 

The reported ecosystem-level E, is higher 
than what has been called the “universal tem- 
perature dependence” of aerobic metabolism* 
—an E, of 0.67+0.15 eV that encompasses 
the metabolism ofa wide range of plants, pro- 
tozoa, invertebrates and vertebrates — and 
is also higher than the average E, (0.72 eV) 
observed for a diverse group of 50 aerobic and 


anaerobic microorganisms’. The higher average 
E, reported here for methanogenic ecosystems 
could reflect either that the biochemistry of 
methanogens (which have an average E, of 
1.10 eV) directly limits methane emissions in 
some ecosystems, or that the organisms that 
supply methanogens with substrates have simi- 
larly high temperature dependence. Neverthe- 
less, the clear implication of these findings is 
that methane production will increase more 
steeply with temperature than would be cap- 
tured by climate-change models that assume 
methane emission is governed by more typi- 
cal (lower) values of E,. For example, over the 
range of global warming projected’ for this 
century (1.0-3.7 °C), an E, of 0.96 eV suggests 
a 14-63% increase in methane emission com- 
pared with 10-40% for an E, of 0.67 eV. 
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Yvon-Durocher and colleagues’ findings 
constrain, and perhaps simplify, one piece of a 
much larger climate-change puzzle. Feedbacks 
from methane emissions in response to global 
climate change will ultimately derive from a 
combination of the direct temperature effects 
considered here and indirect effects such 
as thawing of permafrost (and the resultant 
availability of new organic matter), changes in 
vegetation, and large-scale inundation or dry- 
ing of soils and wetlands. Moreover, methane’s 
proportionate contribution to global warming 
(about 20% over the past century) may actually 
diminish as carbon dioxide takes an increas- 
ingly prominent role in the future’. But in this 
complex problem — unlike methane’s flux to 
the atmosphere — every bit helps. = 
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A high bar for decadal 
forecasts of El Nino 


Climate simulations suggest that multi-decadal periods of high and low 
variability in the phenomenon known as the El Nifio-Southern Oscillation 
in the tropical Pacific Ocean may be entirely unpredictable. 


PEDRO DINEZIO 


he episodic warming and cooling of 
the surface temperature of the tropical 
Pacific Ocean, known as the El Nifio-— 


Southern Oscillation (ENSO), causes year-to- 
year climate fluctuations, affecting weather, 


ecosystems and economies around the world. 
The occurrence of these episodes is not regu- 
lar. For example, whereas the period covering 
the years 1970-2000 witnessed the strong- 
est El Nifio (warming) events on record, the 
years since 2000 have experienced fewer and 
weaker such events. Writing in the Journal of 
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Figure 1 | Decadal forecasts of El Nifio-Southern Oscillation activity. 

Using their GFDL-CM2.1 climate model, Wittenberg et al.' performed 
simulations of the variability of the El Nifio-Southern Oscillation (ENSO), here 
quantified using the Nifio-3.4 sea surface temperature (SST) index. Two distinct 
epochs are shown, characterized by high (a) or low (b) ENSO variability in the 
model’ control run (black curves). For each epoch, 3 sets of 40 simulations 


Climate, Wittenberg et al.' make the case that 
these multi-decadal epochs of enhanced and 
subdued ENSO activity occur randomly and 
therefore may be unpredictable. 

Changes in ENSO behaviour from decade 
to decade are commonly seen in historical 
observations and palaeoclimate proxy 
records” *. These variations were first put into 
context by Wittenberg in an earlier study’, 
which examined a 2,000-year simulation based 
on a fairly realistic climate model, known as 
GFDL-CM2.1. This concluded that decadal- to 
centennial-scale changes in ENSO behaviour 
can be internally generated by the model in 
the absence of any external forcing, such as 
increases in greenhouse-gas concentration or 
variations in solar output. 

Predicting whether the coming decades will 
bring an onslaught of strong ENSO events — 
or none at all — is crucial because of the impact 
of such events on weather patterns around the 
world. Individual episodes may be predicted 
up to two years in advance’, but on larger time- 
scales our ability to forecast ENSO behaviour 
accurately may hinge on how ENSO responds 
to changes in the background climate system. 
This idea is supported by studies suggesting 
that the level of activity could be related to 
natural or man-made changes in the climate 
of the tropical Pacific®”. 

In their latest study, Wittenberg et al.’ used 
the same GFDL-CM2.1 model, this time to 
forecast epochs of high and low ENSO activ- 
ity. For each epoch of activity in the model's 
control run, the authors performed 40 fore- 
casts, each differing by a tiny perturbation 
of the size of the computer's rounding error 
to one of the model’s numerical grid points. 
These ‘perfect model forecasts have the best 


T 
1165 
Model years 


chance of reproducing the extreme ENSO 
epochs seen in the control run, and permit 
assessment of the model’s intrinsic ability to 
predict them. 

The authors found that, beyond the first 
two to four years after initialization of the 
forecasts, the multi-decadal epochs of high 
and low ENSO activity are completely unpre- 
dictable. For each epoch, the model forecasts 
either active or quiet events with the same 
probability. That is, the perturbations can 
alter the forecasts in such a way that the model 
is capable of forecasting an inactive ENSO 
decade where it originally simulated a highly 
active one (Fig. 1a), or an active decade where 
it simulated a quiet one (Fig. 1b). 

This is the ‘butterfly effect’ of chaos theory”” 
applied to ENSO events. Seemingly small per- 
turbations to a system, such as the flapping of 
a butterfly’s wings, may lead to large changes 
in that system. It is a sobering finding, because 
it suggests that the changes observed in ENSO 
behaviour during the twentieth century could 
very well be random fluctuations unrelated to 
natural or man-made changes in the climate of 
the tropical Pacific. 

Further research is needed to determine 
whether the study’s conclusions can be 
extrapolated from the model world to the real 
world. During the past decade, climate models 
have progressed substantially in their ability to 
simulate ENSO events. Many models can now 
emulate the long-term modulation first seen in 
the GFDL-CM2.1 simulations, possibly owing 
to the inclusion of improved wind patterns”. 
But it is not known whether even the best cli- 
mate models simulate the correct mix of the 
myriad processes that influence ENSO. One 
cause of uncertainty might be that the decadal 
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are initialized from model years 1721, 1731 and 1741, and from 1151, 1161 

and 1171, respectively (faint grey curves). Model years do not coincide with 
historical years. The simulations differ in only one tiny perturbation applied to 
the model's initial conditions. For both epochs, at least one simulation out of 40 
exhibits high (red curves) or low (blue curves) ENSO variability, indicating that 
the level of activity is unpredictable. (Graphic courtesy of Andrew Wittenberg.) 


fluctuations in the background climate, which 
are thought to be the source of ENSO predict- 
ability’, are too weak in the models simula- 
tions’. Conversely, models simulate activity 
that is much stronger than observed®", so 
this too-strong ENSO might be oblivious to 
the too-weak changes in background climate, 
resulting in decreased predictability. The real- 
ism of the simulations must be improved if 
model-based conclusions are to be applied to 
the real world. 

Existing observational records are not yet 
long enough for us to investigate whether, and 
how, ENSO responds to long-term climate 
fluctuations that could be sources of predict- 
ability. Progress on this front depends on 
maintaining and expanding our observational 
capability in the ocean, which relies on arrays 
of autonomous profiling floats and tropical 
moorings. In the meantime, results such as 
those of Wittenberg et al. are reminders of the 
challenges associated with forecasting ENSO 
changes. Future attempts to attribute the 
causes of individual events and their decadal 
variations now face a much higher bar. = 
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A protective factor 
for the ageing brain 


What guards the aged brain against neurodegeneration? A study finds that the 
REST protein has a central role in protecting ageing neurons from death and in 
maintaining cognitive capacity in the elderly. SEE ARTICLE P.448 


LI-HUEI TSAI & RAM MADABHUSHI 


Izheimer’s disease is the leading cause 

of dementia. In the United States, an 

estimated 13% of people aged over 65, 
and nearly one-third of those aged over 85, are 
affected by this disorder’. As the population 
ages, the number of people with the disease is 
expected to rise precipitously, with no effec- 
tive therapeutic strategy in sight. A central 
question is why certain individuals who dis- 
play the anatomical and molecular features of 
Alzheimer’s disease still preserve their cogni- 
tive abilities. On page 448 of this issue, Lu et al? 
provide some answers. 

The authors demonstrate that a protein 
called repressor element 1| silencing tran- 
scription factor (REST), which is normally 
expressed at low levels in the neurons of young 
human brains, is profoundly elevated in aged 
brains. By contrast, they observed that REST 
levels are markedly reduced in the nucleus 
of neurons in patients with mild cognitive 
impairment (a condition that often precedes 
dementia) and Alzheimer’s disease. Lu and col- 
leagues also analysed post-mortem samples of 
prefrontal cortex taken from people who had 
previously undergone neuropsychiatric assess- 
ments’, and found that nuclear REST levels 
correlated positively with cognitive function. 
Taken together, these findings underscore 
the importance of maintaining REST protein 
levels in the aged brain for preventing cogni- 
tive decline and Alzheimer’s disease. 

Previous gene-expression studies* ° have 
extensively characterized the changes that 
occur in the ageing brains of healthy individ- 
uals and those with Alzheimer’s. Generally, 
in both groups, neuronal genes implicated 
in synaptic transmission (signalling at neu- 
ral junctions), in calcium signalling and in 
the functioning of inhibitory neurons show 
decreased expression, whereas genes asso- 
ciated with stress responses, DNA-damage 


responses, immune responses and apoptotic 
cell death show increased expression. But there 
are also crucial differences. For instance, toxic 
stimuli that are characteristic of Alzheimer’s- 
related neurodegeneration induce the blockage 
of gene expression associated with neural plas- 
ticity (changes in neural pathways and junc- 
tions) in mouse models of Alzheimer’s and in 
the brains of people with the disease’. 

In their study, Lu et al. used a dazzling array 
of experimental approaches to demonstrate 
that the loss of neuroprotective REST func- 
tions contributes to neuronal vulnerability 
in the brains of those with Alzheimer’s. The 
authors observed that, in the brains of healthy 
aged individuals, nuclear REST both targets 
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and suppresses several pro-apoptotic genes, 
as well as certain genes that encode enzymes 
involved in the pathology of Alzheimer’s. 
But in diseased brains this suppression is 
lost, resulting in the induction of genes that 
are likely to underlie aspects of neuronal loss 
and neurodegeneration (Fig. 1). 

The authors also report that REST-deficient 
mice show no neuronal loss at 1 month old, 
but they do by 8 months. Similarly, the authors 
found that Caenorhabditis elegans worms 
deficient in spr-1, spr-3 and spr-4 (genes that 
evolved from the same ancestral gene as REST) 
are more sensitive to oxidative stress and have 
shorter lifespans than wild-type worms. There- 
fore, REST-mediated transcriptional repres- 
sion confers resistance to neurotoxic stress in 
several scenarios. 

When the researchers treated cultured 
human neurons with hydrogen peroxide 
(an oxidizing agent), and then administered 
some of the resulting culture medium to 
untreated cells, they observed a clear increase 
in the expression of REST messenger RNA by 
those cells. This suggests that a soluble REST- 
inducing factor is produced by neurons upon 
oxidative stress. Furthermore, the authors 
demonstrated that extracts of aged, but not 
young, human prefrontal cortex robustly 
induce REST expression in neural cells in vitro. 
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Figure 1 | REST activation distinguishes healthy aged brains from the brains of those with 
Alzheimer’s disease. a, Lu et al.’ report that various neurotoxic stimuli in the ageing brains of healthy 
people cause increased expression of the REST protein, an effect that is at least partially dependent on 

the Wnt-signalling pathway. REST induction activates neuroprotective stress-response pathways, while 
suppressing neuronal apoptotic cell death and the toxic effects of the accumulation of amyloid-B (AB, the 
main protein component of the plaques found in the brains of patients with Alzheimer’s disease) and of 
elevated oxidative stress. b, In the brains of those with Alzheimer’s disease, both Wnt signalling and REST 
induction are suppressed, leading to neurodegeneration. 
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Conversely, they found that extracts of cortex 
from those affected by Alzheimer’s had 
reduced REST-inducing activity compared 
with age-matched controls. 

REST is a target of the Wnt-B-catenin sig- 
nalling pathway’. Lu et al. showed that REST 
can be induced by Wnt-3a and Wnt-7a (two 
proteins from the Wnt family), and that a Wnt- 
receptor antagonist partially inhibits REST 
induction. These results suggest that stress can 
increase Wnt signalling, which, in turn, induces 
REST expression. It remains to be determined 
whether Wnt is elevated in only some parts 
of the brain to influence REST — the authors 
report evidence of Wnt signalling in the hippo- 
campus and the frontal cortex, but not in the 
cerebellum, for example. The question of 
whether the observed changes in Wnt signal- 
ling originate from neurons or from other cells 
in the brain also deserves further investigation. 

Lu and colleagues went on to show that loss 
of nuclear REST is associated with a substantial 
increase in the expression of genes implicated 
in apoptosis and Alzheimer’s pathology in both 
the least severe (Alzheimer’s disease 1, AD1) 
and most severe (AD2) forms of the disease. 
Unexpectedly, however, they observed that 
another set of genes targeted by REST — those 
involved in synaptic transmission and other 
functions of neural junctions — initially show 
increased expression in AD1, but then reduced 
levels in AD2. It is plausible that the increased 
expression of these genes in AD1 is a com- 
pensatory mechanism to maintain neuronal 
homeostasis. Further analysis of gene expres- 
sion in specific subtypes of neural cells should 
help to clarify precisely how the observed 
gene-expression patterns arise. 

The authors’ observations also raise the 
question of why gene-expression profiles 
differ in AD1 and AD2. Further characteriza- 
tion of the environment of the affected genes’ 
chromatin (the DNA-protein complex in 
which genetic material is organized) and of the 
binding of other transcription factors in the 
genes’ regulatory regions in the brains of the 
young, the aged, and those with AD1 or AD2, 
should provide greater insight into this matter. 

Excitingly, this study provides the first 
detailed investigation of molecular markers 
in the brain that differentiate between popu- 
lations of the young, the aged and those with 
Alzheimer’s. Furthermore, by showing that 
ageing in the brain might be associated with 
the activation of a specific stress-response 
program, it implies that sustained mainten- 
ance of this program confers protection from 
neurodegeneration. Indeed, all the healthy 
centenarians studied in the research showed 
uniformly high levels of REST. 

Could therapeutically stimulating nuclear 
REST activity in the brain prevent Alzheimer’s 
and other degenerative diseases? On the basis 
of Lu and co-workers’ results, one strategy 
would be to activate Wnt signalling in aged 
individuals. However, such activation is also 


implicated in the development of various 
cancers’, and so this approach would probably 
require careful targeting of Wnt activation in 
the brain. Alternative strategies include find- 
ing either Wnt-independent REST activators 
or small molecules that prevent the export of 
REST from the nucleus. A deeper understand- 
ing of the molecular mechanisms that govern 
REST activation in the ageing brain will be 
crucial for such efforts to be successful. m 
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Chaos in the cold 


A marriage between theory and experiment has shown that ultracold erbium 
atoms trapped with laser light and subjected to a magnetic field undergo 
collisions that are characterized by quantum chaos. SEE LETTER P.475 


PAUL S. JULIENNE 


he use of magnetic fields to manipulate 

the interactions of ultracold atoms at 
temperatures below one microkelvin 

has allowed the experimental and theoretical 
study of a plethora of exotic phenomena in 
quantum physics’. The report by Frisch et al.’ 
on page 475 of this issue adds an entirely new 
twist to this work. Until now, such research has 
mainly used simple atoms that have relatively 
simple interactions. Frisch and colleagues 
worked instead with a complex atomic species, 
erbium, and combined theory and experiment 
to demonstrate the signature of quantum chaos 
in the collisions between two erbium atoms. 
The results offer the prospect of exploring new 
avenues of ultracold physics and chemistry 
using complex atomic and molecular species. 
In their study, Frisch et al. used a tightly 
focused laser to trap samples of around 
100,000 erbium (Er) atoms in their lowest- 
energy quantum state at a temperature of about 
400 nanokelvins. The samples consisted either 


of bosonic isotopes (‘Er or Er), which have 
integer spin, or of a fermionic one ('’Er), 
which has half-integer spin. They tuned an 
applied magnetic field to a fixed value between 
0 and 7 millitesla and counted the number of 
atoms remaining after holding them in the trap 
for 400 milliseconds. The authors found that 
the number of atoms left in the trap depended 
strongly on the particular value of the mag- 
netic field selected. Such atom losses are well 
known in cold-atom physics, and are used 
to locate features known as Feshbach reso- 
nances, which occur when the total (nearly 
zero) energy of two colliding atoms matches 
the magnetically tunable energy of a bound 
diatomic molecular state (Er,). Tabulating 
the magnetic-field values at which losses exist 
pinpoints the ‘positions’ of the resonances, 
from which the dynamics of the underlying 
collisions can be probed. 

Frisch et al. found that the number of 
resonances in their system is surprisingly 
large, much larger than obtained with alkali- 
metal species such as lithium, rubidium or 


> 


Magnetic field 


Figure 1 | A barcode of resonances. The lines mark the different values of an applied magnetic field at 
which erbium atoms from a trapped sample studied by Frisch et al.’ show increased loss from the trap. 
The distribution of spacing between adjacent lines, which locate features known as Feshbach resonances, 
indicates that collisions between two atoms have quantum chaotic behaviour. 
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caesium (Cs). They provide theoretical models 
that connect this behaviour with chaotic quan- 
tum dynamics of the collisions. Because alkali- 
metal atoms such as Cs have a simple atomic 
structure (they have zero electronic orbital 
angular momentum), the resonances of col- 
liding Cs atoms are spaced relatively far apart 
as a function of increasing magnetic field, and 
are characterized by simple quantum numbers. 
By contrast, an Er atom has a complex atomic 
structure that gives it a total electronic orbital 
angular momentum of five. Therefore, col- 
liding Er atoms have many more resonances 
than their Cs analogues, and they are ‘mixed’ 
— that is, they are no longer described by 
simple quantum numbers. 

To explain the nature of their measured 
resonance spectrum (Fig. 1), as well as 
that calculated by their model, Frisch et al. 
borrowed a tool known as random matrix 
theory from earlier work in nuclear and other 
fields of physics. By assuming that the vari- 
ous interactions of their system are described 
by the parameters of this theory, they could 
reproduce the average spacing between suc- 
cessive resonances. They found that the distri- 
bution of spacing for ’“Er or '°°Er bosons was 
much closer to one that corresponds to chaotic 
dynamics, known as a Wigner—Dyson distri- 
bution, than to the Poissonian distribution that 
characterizes regular, non-chaotic dynamics. 
The '’Er fermions, which also have nuclear 
spin, showed an even denser set of resonances 
than the bosonic isotopes, but the authors 
have not yet analysed this set in detail because 
of its greater complexity. A study that was 
reported’ this year measured similarly dense 
sets of resonances in systems of cold bosonic 
and fermionic isotopes of dysprosium (Dy). 

Why are these results significant? Let us 
consider what is special about cold collisions. 
Because cold atoms move slowly, on average 
more than 10,000 times more slowly than 
atoms at room temperature, the Heisenberg 
uncertainty principle ensures that the uncer- 
tainty in their position is large. The colliding 
atoms no longer behave like particles, but take 
on wave-like character, with a wavelength 
that is inversely proportional to their veloc- 
ity. These wavelengths can become large, 
one micrometre or more, much larger than 
the sub-nanometre length ofa chemical bond. 
In this case, collisions become simple and are 
characterized by a parameter known as the 
scattering length. 

Part of the power and beauty of cold-atom 
physics is that the scattering length can be 
made to take on any value by tuning a mag- 
netic field close to a Feshbach resonance’. Its 
value controls the two-body, few-body and 
many-body physics of ultracold quantum 
matter. Thus, controlling the field makes the 
system dance to our tune. Previously, the Fesh- 
bach resonances used for such control have 
been isolated resonances that vary with the 
field and the atomic kinetic energy in a simple 


and well-understood way. Now, cold-atom 
researchers have to figure out how to under- 
stand the scattering length associated with a 
dense set of chaotic resonances such as that 
observed by Frisch and colleagues. The varia- 
tion with field and kinetic energy will be more 
complex than previously encountered. The 
same is true of the atom-loss processes that 
are associated with such resonances and that 
determine the lifetime of a cold atomic system. 

Another reason for working with Er is that 
its magnetic dipole moment is seven times 
larger than that of alkali-metal atoms. The 
dipole moment for Dy is even larger. Such large 
dipole moments, which result in a long-range 
interaction between pairs of dipoles, could 
enable the realization of some of the rich vari- 
ety of phenomena predicted for an ensemble 
of cold dipoles*”. It will now be necessary to 
understand the interplay between the dense 
set of resonances and the long-range dipolar 
interactions. 

Finally, cold molecules — which have rota- 
tional, vibrational and other internal degrees 
of freedom — are expected to have much 
denser resonance spectra than Er or Dy atoms. 
Work is also being done to make systems of 
cold polar molecules, which can have much 
stronger dipolar interactions than atoms and 
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exhibit a larger range of dipolar phenomena. 
Therefore, Frisch and colleagues’ study is but a 
prelude to the interesting work that is to come 
on cold molecular systems, the resonances 
of which may have quite long lifetimes®. We 
can anticipate that the considerable body of 
work carried out on the internal relaxation 
and chemical reactions of excited molecu- 
lar complexes”* will come into play when 
chaos turns up at the heart of cold molecular 
collisions. = 
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The zinc link 


Increased influx of zinc into chondrocytes — the cells that make up cartilage — 
has been found to activate matrix- degrading enzymes that cause the destruction 


of cartilage in osteoarthritis. 


VIRGINIA BYERS KRAUS 


he classic wound-healing response 
consists of a cascade of events that can 
be arbitrarily divided into four phases: 
blood clotting and bleeding cessation; inflam- 
mation; cellular proliferation; and wound 
remodelling with scar-tissue formation’. 
Osteoarthritis, the most prevalent and recalci- 
trant form of arthritis, has been proposed to be 
the result ofa chronic wound-healing response 
in joints’. Articular cartilage, which is the tissue 
that lines joint surfaces, has no blood vessels, 
so wound healing at these sites begins with 
the inflammatory phase. This phase is char- 
acterized by an extensive auto-debridement 
response, in which dead or damaged tissue 
is removed. Writing in Cell, Kim et al.’ pro- 
vide compelling data that help to unravel the 
mechanistic details of this ‘clean-up’ phase, 
how it can progress to osteoarthritis, and the 
role of zinc in this process. 
Second only to iron in trace-element abun- 
dance in the human body, and well known as 


a component of wound-healing salves, zinc 
has long been implicated in wound healing 
through promoting auto-debridement, anti- 
infective activity and the re-formation of skin’. 
Now, however, Kim et al. show that zinc has 
a deleterious effect on cartilage through its 
ability to upregulate enzymes responsible for 
cartilage degradation in a process that bears 
a striking resemblance to auto-debridement 
during wound healing. 

The authors found that expression of the 
zinc-transporting protein ZIP8 increased 
in vitro in chondrocytes — the cells that make 
up cartilage — in response to IL-1, an inflam- 
matory cell-signalling molecule that is a key 
mediator of osteoarthritis. Overexpression of 
ZIP8 in chondrocytes led to increased levels 
of intracellular zinc and of zinc-dependent 
metalloprotease matrix-degrading enzymes 
that have been implicated in osteoarthritis, 
including MMP3, MMP9, MMP12, MMP13 
and ADAMTSS. These metalloproteases are 
key mediators of the auto-debridement phase 
of wound healing: they regulate inflammation; 
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degrade the tissue matrix surrounding cells, 
thereby facilitating cell migration to the area; 
and remodel the new extracellular matrix’. 

Kim et al. also found higher ZIP8 and intra- 
cellular zinc levels in the cartilage of humans 
with osteoarthritis, and in that of mice that 
model this condition, than in normal cartilage. 
Increasing or decreasing zinc-transport activity 
through cartilage-specific up- or downregula- 
tion of ZIP8 in mice respectively increased or 
decreased the severity of the disease. One of the 
factors mediating the effect of zinc seems to be 
the metal-sensing transcription factor MTF1 
— the authors observed that zinc enhanced the 
nuclear localization and transcriptional activ- 
ity of this protein, and that its overexpression 
led to increased expression of matrix metallo- 
proteases and to osteoarthritis. These findings 
are consistent with the recent demonstration® 
that ZIP8 levels influence the severity of osteo- 
arthritis. Taken together with the knowledge 
that excessive metalloprotease activity con- 
tributes to the development of chronic skin 
wounds’, these data support the theory that 
osteoarthritis is an internal wound-healing 
process that engenders chronic joint disease. 

Because osteoarthritis is a complex disease 
that involves many interacting tissues, eluci- 
dating the temporal course of its develop- 
ment is challenging. Kim et al. show that 
modulation of ZIP8 is sufficient to give rise 
to typical osteoarthritis-related features in 
bone — a finding that supports the ‘cartilage 
first’ scenario in the long-standing debate 
regarding whether osteoarthritis develops 
first in cartilage or in bone’. Overexpression 
of ZIP8 also enhanced cartilage destruction in 
the absence of apparent inflammation of the 
synovial lining of the joint, further bolster- 
ing the idea that initial cartilage destruction 
triggers a cascade of disease-aggravating 
events in neighbouring joint tissues. This is 
consistent with the emerging model of the dis- 
ease, in which metalloproteases, in addition to 
chewing away cartilage, generate fragments of 
cartilage molecules that are perceived by the 
immune system as danger signals’, eliciting 
an inflammatory response that contributes to 
disease progression (Fig. 1). 

As demonstrated in mice, there is a correla- 
tion between wound healing, cartilage regen- 
eration and protection from osteoarthritis’. For 
example, mice of the ‘superhealer’ strain undergo 
faster cutaneous-wound healing, experience a 
less intense and shorter inflammatory response 
to joint injury, and develop less severe osteo- 
arthritis than non-superhealer mice’. Therefore, 
a better understanding of the phases and mecha- 
nisms underlying the wound-healing response is 
likely to reveal potential avenues for the treatment 
of osteoarthritis. 

How might the human wound-healing 
response be manipulated to resemble the 
superhealer phenomenon? Kim et al. specu- 
late that local depletion of zinc, inhibition of 
ZIP8 function and zinc influx and inhibition 
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Figure 1 | The downward spiral of osteoarthritis. 
Osteoarthritis is thought to occur when the 
wound-healing response to a joint injury fails to 
occur satisfactorily. This response begins with the 
activation of cell-signalling molecules, such as 
members of the IL-1 family, in chondrocytes. 

Kim et al.’ show that IL-1 induces expression 

of the protein ZIP8, triggering a series of events that 
lead to disease development. First, ZIP8 mediates 
an influx of zinc into the cells, translocating the 
metal-sensing transcription factor MTF1 to the 
nucleus and thereby enhancing the activity of zinc- 
dependent metalloprotease enzymes (MMPs) that 
induce cartilage breakdown. Fragments of cartilage 
resulting from this enzymatic activity, referred to as 
disease-associated molecular patterns (DAMPs), 
are then recognized as danger signals by immune 
cells®, inducing inflammation of the joint lining, 
known as synovitis, that feeds back to further 
aggravate the response. 


of MTF1 activity in cartilage might be effective 
therapeutic approaches. Indeed, inhibiting 
the target of zinc, the matrix metalloproteases, 
has long been considered a logical treatment 
strategy. Although early attempts at metallo- 
protease inhibition suffered from a lack of selec- 
tivity and were associated with musculoskeletal 
side effects'', more-selective metalloprotease 
inhibitors are showing promising therapeutic 
potential'’, as are strategies for inhibiting the 
zinc-ZIP8-MTF1 axis®. 

In all such studies, it will be important to 
consider the broader effects of zinc on the 
body. Might zinc supplementation in humans 
worsen osteoarthritis? And is it feasible to 
lower zinc levels in a joint without affecting 
systemic zinc availability? For successful thera- 
peutic targeting, we will also need to fill gaps 
in our understanding of the molecular basis of 
osteoarthritis. What are the initial events that 
upregulate ZIP proteins in cartilage? And how 
is IL-1, the inducer of ZIP8, itself induced? 

Mechanical injury of cartilage is a potent 
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inducer of metalloproteases, but not of IL-1 Be; 
whereas skin wounding induces the local 
release of IL-1la, which increases intracellular 
IL-1 in chondrocytes'’. Because IL-1a has 
been proposed as the key intermediary in 
the early stages of the disease, with IL-1 the 
dominant mediator in more advanced stages", 
it would be interesting to determine whether 
IL-1a can also activate the zinc-ZIP8-MTF1 
axis. Finally, given that ZIP14, a close homo- 
logue of ZIP8 and a zinc and iron transporter, 
is implicated’* in haemochromatosis (an iron- 
accumulation condition that is also a cause of 
osteoarthritis), additional study of metal trans- 
porters in general may yield further insight 
into the pathways to osteoarthritis. 

A healed wound has been described’ as “one 
that has been returned to a normal anatomi- 
cal structure, function and appearance of the 
tissue within a reasonable period of time”. But 
we do not know how often joints completely 
return to normal after injury, or if they ever 
do’’. In humans, osteoarthritis is diagnosed 
only once it has progressed to revealing overt 
signs of disease, such as cartilage loss, and 
such cases presumably reflect joints that have 
undergone an unresolved wound-healing 
response. Perhaps our growing understanding 
of the molecular underpinnings of this disease 
will allow signs of osteoarthritis to be detected 
earlier in the wound-healing battle, so that this 
response can be turned off sooner or chan- 
nelled into a regenerative response, thereby 
preventing disease development. = 
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The ultimate physical limits of privacy 


Artur Ekert'? & Renato Renner® 


Among those who make a living from the science of secrecy, worry and paranoia are just signs of professionalism. Can we 
protect our secrets against those who wield superior technological powers? Can we trust those who provide us with tools 
for protection? Can we even trust ourselves, our own freedom of choice? Recent developments in quantum cryptog- 
raphy show that some of these questions can be addressed and discussed in precise and operational terms, suggesting 
that privacy is indeed possible under surprisingly weak assumptions. 


pher, once wrote “... it may be roundly asserted that human inge- 

nuity cannot concoct a cipher which human ingenuity cannot 
resolve ...”’. Is it true? Are we doomed to be deprived of our privacy, no 
matter how hard we try to retain it? If the history of secret communi- 
cation is of any guidance here, the answer is a resounding ‘yes’. There is 
hardly a shortage of examples illustrating how the most brilliant efforts 
of code-makers were matched by the ingenuity of code-breakers*. Even 
today, the best that modern cryptography can offer are security reduc- 
tions, telling us, for example, that breaking RSA, one of the most widely 
used public key cryptographic systems, is at least as hard as factoring 
large integers’. But is factoring really hard? Not with quantum technology. 
Indeed, RSA, and many other public key cryptosystems, will become inse- 
cure once a quantum computer is built*. Admittedly, that day is probably 
decades away, but can anyone prove, or give any reliable assurance, that 
it is? Confidence in the slowness of technological progress is all that the 
security of our best ciphers now rests on. 

This said, the requirements for perfectly secure communication are 
well understood. When technical buzzwords are stripped away, all we 
need to construct a perfect cipher is shared private randomness, more 
precisely, a sequence of random bits known asa ‘cryptographic key’. Any 
two parties who share the key, we call them Alice and Bob (not their real 
names, of course), can then use it to communicate secretly, using a simple 
encryption method known as the one-time pad®. The key is turned into a 
meaningful message by one party telling the other, in public, which bits of 
the key should be flipped. An eavesdropper, Eve, who has monitored the 
public communication and knows the general method of encryption but 
not the key will not be able to infer anything useful about the message. It is 
vital though that the key bits be truly random, never reused, and securely 
delivered to Alice and Bob, who may be miles apart. This is not easy, but 
it can be done, and one can only be amazed how well quantum physics 
lends itself to the task of key distribution. 

Quantum key distribution, proposed independently by Bennett and 
Brassard® and by Ekert’, derives its security either from the Heisenberg 
uncertainty principle (certain pairs of physical properties are comple- 
mentary in the sense that knowing one property necessarily precludes 
knowledge about the other) or the monogamy of quantum entanglement 
(certain quantum correlations cannot be arbitrarily shared). At first, the 
idea of using quantum phenomena to improve secrecy was nothing more 
than an academic curiosity, but over time, with the progress of quantum 
technologies, it was embraced by experimental physicists and eventually 
turned into a viable commercial proposition. But even though quantum 
cryptography can offer the best security available at present, it is not immune 
to attacks exploiting botched implementations (see, for example, refs 8-11 


E dgar Allan Poe, an American writer and an amateur cryptogra- 


for practical illustrations). The flaws in the design may be unintentional, 
the result of ignorance or negligence on the part of some honest indivi- 
duals who design quantum cryptosystems; but they can also be malicious, 
secretly implanted by powerful adversaries. Should we not then dissect 
our cryptographic devices, analyse them and make sure that they do exactly 
what they are supposed to do? Given that some of the flaws may be unknown 
to us, what exactly should we be looking for? It has long been believed that 
here we reach the limits of privacy, and that at this point whoever is more 
technologically advanced, be it the NSA, GCHQ or some other agency, 
has the upper hand. Surprisingly, this is not the case. 

Recent research shows that privacy is possible under stunningly weak 
assumptions. All we need are monogamous correlations and a little bit of 
‘free will’, here defined as the ability to make choices that are indepen- 
dent of everything pre-existing and are hence unpredictable’*’*. Given 
this, we can entertain seemingly implausible scenarios. For example, devices 
of unknown or dubious provenance, even those that are manufactured 
by our enemies, can be safely used to generate and distribute secure keys. 
There are caveats, of course: the devices must be placed in well-isolated 
locations to prevent any leaks of the registered data, and the data must be 
analysed by a trusted entity. Barring this, once the devices pass a certain 
statistical test they can be purchased without any knowledge of their inter- 
nal working. This is a truly remarkable feat, also referred to as “device- 
independent’ cryptography'* °. Needless to say, proving security under such 
weak assumptions, with all the mathematical subtleties, is considerably more 
challenging than in the case of trusted devices, but the rapid progress in the 
past few years has been very encouraging, making device-independent cryp- 
tography one of the most active areas of quantum information science. 

In fact, some of the device-independent schemes do not even rely on 
the validity of quantum theory’'™, and they therefore guarantee security 
against adversaries who may have access to superior, ‘post-quantum’, tech- 
nologies. The adversaries may even be given control over the choices made 
by Alice and Bob during the key distribution protocol’. As long as this 
control is not complete, Alice and Bob can do something about it. It turns 
out that ‘free will or, more specifically, the ability to make unpredictable, 
and, therefore, random, choices can be amplified”. Randomness ampli- 
fication has recently triggered a flurry of research activity, culminating in 
a striking result: anything that is not completely deterministic can be made 
completely random”’”®. This means, as we explain below, that as long as 
some of our choices are random and beyond control of the powers that 
be, we can keep our secrets secret. 


The power of free choice 


If there is one encryption method that comes close to a perfect cipher, it 
is the one-time pad. As we have already explained, its security critically 


1Mathematical Institute, University of Oxford, Oxford OX2 9GG, UK. Centre for Quantum Technologies, National University of Singapore, 117543 Singapore. “Institute for Theoretical Physics, ETH Zurich, 


8093 Zurich, Switzerland. 


27 MARCH 2014 | VOL 507 | NATURE | 443 


©2014 Macmillan Publishers Limited. All rights reserved 


| RESEARCH | PERSPECTIVE 


relies on the randomness and secrecy of the cryptographic key. There is a 
snag, however, known as the ‘key distribution problem’. Each key bit can 
be used only once, to encrypt one single message bit. To maintain their 
private communication, Alice and Bob must find a way to generate and 
distribute fresh key bits continuously. But how? 

Let us put all the practicalities aside, just for a moment, and dream 
about something that would solve the key distribution problem. For 
example, imagine that Alice and Bob were given two magically linked 
coins, which always come out the same side up—either two heads or two 
tails—with equal probabilities. Alice and Bob can then toss such coins 
at their respective locations, writing ‘0’ for heads and ‘I’ for tails. The 
resulting binary strings will be random and identical, but will they be 
secret? Not necessarily. Technologically superior Eve could have manu- 
factured an additional coin, magically linked to the coins held by Alice and 
Bob. The three coins always tally and Eve knows all the bits in the string. 

Clearly, to achieve secrecy we must let Alice and Bob do something 
that is beyond Eve’s control. For example, Alice and Bob may be given a 
choice between two different coins; Alice can toss either coin A, or coin 
A, and Bob, either B, or B>. For each toss they must choose one of the 
two; tossing both A; and A, or both B, and B, is forbidden. Suppose, 
again, that the coins are magically linked; Alice and Bob’s coins always 
come out the same, except when they toss A, and B>, which always come 
out opposite. The magic can be succinctly summarized by the following 
four conditions*””® (Fig. 1): 


A, =B,, B, 


Az, Az 


By, By AA, (1) 


These conditions are clearly contradictory; it is impossible to assign 
values to .A;, A>, B, and By; so that all the four conditions are satisfied. 
But remember, Alice and Bob can toss only one coin each, and thus 
they can test only one of the four conditions in equation (1) at a time. 
Unperformed tosses do not have outcomes, and, hence, there is no 
contradiction here. 

What if, say, Alice could break the rule and toss both of her coins, A, 
and Az, in one go? It turns out that she would deprive Bob of his free 
choice. Suppose that Alice tossed first (correlations are not affected by 
the chronological order of the tosses) and that her outcomes are such 
that A, = A. Then Bob has no choice but to toss B,, because this is the 
only choice compatible with the conditions in equation (1). Similarly, if 
A, # Ag, the only choice left to Bob is to toss B>. This simple argument 
implies that the magic coins cannot be cloned. Having a clone, Z, of, say, 
A, (such that Z = A), and being able to toss it together with A, would 
lead to the same contradictions as tossing both A; and A». The existence 
of Z deprives Bob of his free choice. The conclusion is that if Alice and 
Bob have free choice then the magic correlations must be monogamous, 
that is, nothing else can be correlated to their coins. This turns the tables 
on Eve. Neither she nor anyone else can manufacture a coin that will 
always tally with any of the coins held by Alice or Bob. All ingredients for 
secure key distribution are now in place. 


Key distribution 


To establish a cryptographic key, Alice and Bob toss their magic coins. For 
each toss, Alice and Bob choose randomly, and independently of each 


Alice 4 CS 

Figure 1 | Magic correlations. Alice and Bob choose and toss one coin each. 
Their choices are free, random and independent of each other, and the coins 
always come out the same way up, except when they toss A, and B2, which 
always come out the opposite way up (represented by the red wiggly lines). Such 
correlations cannot be shared with a third party; for example, nobody can 


manufacture a coin that will always tally with any of the coins held by Alice 
or Bob. 
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other, which particular coin will be tossed: Alice is choosing between A, 
and A, and Bob, between B, and By». After the toss, they announce pub- 
licly the coins they selected, but not the outcomes they registered. The 
outcomes are secret, because the coins cannot be cloned, and identical, 
because the coins are magically linked (except when A, and B; are tossed, 
in which case either Bob or Alice must flip his or her bit). The net result 
is that Alice and Bob share one secret bit. To establish a longer key, they 
simply repeat this procedure as many times as required. 

We note that Alice and Bob do not need to make any assumptions about 
the provenance of the coins; as long as the coins comply with the conditions 
in equation (1), they are as good as it gets and could have been manufactured 
by anyone, adversaries included. But this compliance has to be checked. 
Alice and Bob can do it, for example, by revealing the outcomes of some 
randomly chosen tosses and checking if they agree with equation (1). 
Such publicly disclosed tosses are then discarded and the key is composed 
from the remaining tosses, outcomes of which have never been revealed 
in public. If Alice and Bob notice a deviation from the magic correlations, 
they abort the key distribution and try again with another set of coins. 

Here we have tacitly assumed that Alice and Bob can communicate in 
public, but in such a way that nobody can alter their messages; for example, 
they might use a radio broadcast or an advert in a newspaper, or some 
other way that prevents impersonations. This communication is passively 
monitored by Eve and is the only information she gathers during the key 
distribution, because the coins are tossed in well-isolated locations that 
prevent any leaks of the registered outcomes. Given this, the secrecy of the 
key is based solely on the monogamy of the magic correlations and on one 
innocuous but essential assumption: both Alice and Bob can freely choose 
which coins to toss. 

It seems that we have already achieved our goal. There is only one 
little problem with our, otherwise impeccable, solution of the key distri- 
bution problem, which is that the magic correlations do not exist. That 
is, we do not know of any physical process that can generate them. But all 
is not lost, because there are physically admissible correlations that are 
‘magical’ enough for our purposes. Welcome to the quantum world! 


The quantum of solace 


Quantum theory is believed to govern all objects, large and small, but its 
consequences are most conspicuous in microscopic systems such as indi- 
vidual atoms or photons. Take, for example, polarized photons. Millions 
of identically polarized photons form the familiar polarized light, but at 
the quantum level polarization is an intrinsic property of each photon, 
corresponding to its spin. Although the polarization of a single photon 
can be measured along any direction, the outcome of the measurement 
has only two values, indicating whether the polarization is parallel or 
orthogonal to the measurement direction. For our purposes, we will label 
these outcomes 0 and 1. 

A number of quantum optical techniques can be employed to gen- 
erate pairs of polarization-entangled photons. Such photons respond to 
measurements, carried out on each of them separately, in a very coordi- 
nated manner. Suppose that Alice and Bob measure the polarizations of 
their respective photons along different directions, « and f. It turns out 
that, although the values 0 and 1 are equally likely to appear, Alice and 
Bob’s outcomes tally with the probability 


cos*(a— f) (2) 


This is just about everything you need to know about quantum physics 
for now. 

Let us now replace the coin tosses by appropriately chosen polarization 
measurements: instead of tossing coin Aj, Alice simply measures her photon 
along «, = 0; and instead of tossing Ay, she measures the photon along 
02 = 21/8. Similarly, Bob replaces his coin tosses B, and B, by measure- 
ments along directions f, = n/8 and 2 = 31/8, respectively. The resulting 
joint probabilities of all possible outcomes, obtained using equation (2) 
and the specified polarization angles, are shown in Table 1. 

From a more general perspective, for any value of ¢, which can be con- 
sidered the probability of deviation from the magic correlations, the 
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Table 1 | Approximating magic correlations 


A, =0 A,=1 A2=0 A2=1 
B,=0 l—-e & l-e & 
2 2 2 

B,=1 & 1l—< & l—-e« 
2 2 2 2 
Bo =0 & l-e« l-« & 
2 2 2 2 

Bo=1 l—s & & l—-« 
2 2 2 2 


Joint probabilities of binary outcomes given the choices of A; and B; (i,j = 1,2). The parameter « takes 
the value O for the magic correlations (see equation (1)). The lowest physically admissible value, 

= sin°(x/8) ~ 0.146, can be obtained by measuring polarizations of appropriately entangled photons 
at some specific angles, for example 0, 2/8, 2/8 and 3n/8, corresponding to Aj, Bi, Az and Bo, 
respectively. 


table describes ‘non-signalling’ correlations: Alice, by choosing between 
A, and A2, cannot communicate any information to Bob, and vice versa 
Bob choosing between B, and B2 cannot send any information to Alice. 
Neither of them can see through the statistics of the outcomes what the 
other one is doing. Correlations with ¢ = 1/4 are called ‘classical’, because 
they admit pre-assigned values of A,, Az, B, and By. This is no longer the 
case when ¢ < 1/4, because any pre-assignment is bound to violate at least 
one of the four conditions in equation (1). Surprisingly, as we have just 
seen, there are physically admissible correlations for which ¢ can reach 
sin?(1/8) ~ 0.146, which is the lowest value that can be achieved with quan- 
tum correlations*'. Even though perfect magic correlations, with ¢ = 0, 
do not exist, there is still some magic left in quantum correlations, and it 
can be exploited. 


Less reality, more security 


The impossibility of assigning numerical values to certain physical quan- 
tities, for example the different polarizations of a photon, has been baffling 
physicists for almost a century”. After all, most of us grew up holding 
it self-evident that there is an objective reality in which physical objects 
have properties that can be quantified and whose values exist regardless 
of whether we measure them or not. Shocking as it may be, our world is 
not of this kind. Statistical inequalities, such as ¢ = 1/4, derived on the 
assumption that the values of unmeasured physical quantities do exist 
and commonly referred to as Bell’s inequalities*’, have been violated in a 
number of painstaking experiments**““*. We shall not dwell on the philo- 
sophical implications of this experimental fact (volumes have been written 
on the subject), but simply point out that it should be embraced by all 
those who worry about secrecy because what does not exist cannot be eaves- 
dropped, and so it is much easier to keep secrets in a non-classical world. 

Indeed, given the correlations parameterized by ¢, it can be shown that 
the probability of Eve guessing correctly any particular outcome cannot 
exceed (1 + 4¢)/2 (Box 1). Eve may know something about the outcomes 
(which is not good) but Alice and Bob, after running a statistical test and 
estimating ¢, know how much she may know (which is good). If ¢ is low 
enough, this allows them to distil an almost perfect key from the out- 
comes, using a technique known as ‘privacy amplification’.°“° The basic 
idea behind privacy amplification is quite simple. Imagine that you have 
two bits and that you know your adversary knows at most one of them, 
but that you do not know which one. Add the two bits together (mod- 
ulo 2); the resulting bit will be secret. Needless to say, given more bits, 
there are more sophisticated ways of achieving secrecy, to mention only 
two-universal hash functions” or Trevisan’s extractor’. 

In summary, whenever Alice and Bob are given any devices that gen- 
erate correlated outcomes, they can run the key distribution protocol 
supplemented by a statistical ‘honesty test’ to estimate e. If this value is 
small enough, say ¢ = 0.15, the end result, after privacy amplification, is 
a perfect cryptographic key. We obtain trusted privacy from untrusted 
devices, but what constitutes a device? We need to sort out one more 
thing before we can celebrate the arrival of the ultimate cipher. Should 
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BOX | 
Eavesdropping quantified 


Suppose that Eve wants to manufacture a device that outputs binary 
values, Z, designed to tally with, say, A;. Regardless of her technological 
prowess, Eve has limited chances to succeed. For any two outcomes, A; 
and B;, the probabilities that they are equal to Z, that is, Pr(Z = A)) and 
Pr(Z = B)), cannot differ by more than Pr(A; # 8). This implies a 
sequence of inequalities: 

Pr(Z=A1)— Pr(Z=B;)< Pr(Ai B81) 

Pr(Z=B,)— Pr(Z=Az) < Pr(Bi #A2) 

Pr(Z=Az)— Pr(Z=Bo) < Pr(Az 4Bo) 

Pr(Z=Bo)— Pr(Z4Aj) < Pr(B2 =Ai) 
Adding these inequalities together and taking into account that 
Pr(Z#A}) = 1-Pr(Z =A)) gives 


PriZ=Ar)< 3(1 +h) 


where the quantity /2 = Pr(A; # Bi) + Pr(By #Az2) + Pr(As # Bo) + 
Pr(Bz = Aj) is the sum of the probabilities that any of the conditions in 
equation (1) is violated. The derivation presented here works for any A; 
and Bj, and, it is worth stressing, does not involve quantum theory. 

Although the values Aj, Az, B; and Bo do not coexist, all the 
probabilities used here involve only pairs of values, A; and B,, which can 
be measured simultaneously. They can be determined from the 
statistics of the experimental data. For the polarization measurements 
described in the text, we would obtain I> = 4e where ¢ = sin?(n/8) ~ 
0.146. The bound thus asserts that Pr(Z = A) = 0.793; that is, Eve’s 
value, Z, will deviate from A, in more than 20% of the cases. 

The notion of magic correlations can be extended to cases where 
Alice and Bob choose between n = 2 different measurements®”®, with 
the conditions in equation (1) replaced by 


Ai =81, Bi=Apz, ..., An=Bn, Bn ¥A1 (3) 


To approximate such correlations, Alice and Bob may use entangled 
photons and measure polarizations A; and B;, specified by angles «; 
and f;. These angles are chosen to be even and odd multiples of n/4n, 
respectively, so that the adjacent values of o; and f; are x/4n radians 
apart. Then, according to equation (2), each of the conditions in 
equation (3) is satisfied, except with an error probability of 

én = sin*(n/4n) < 1/n”. Itcan then be shown, by the same arguments as 
forthen = 2 case, that any attempt by Eve to compute a prediction, Z, for 
the outcome of, say, Ai, can succeed with probability at most (1 + /,)/2, 
where |, = Pr(A; By) + Pr(B; Az) + --» + Pr(A, ¥B,) + Pr(B, = Ai). 
For any classical correlations, /, = 1. In contrast, quantum theory 
admits correlations such that /, = 2ne, < 2/n. Consequently, in the 
limit of large n, the probability of Eve guessing the value of A; correctly 
becomes 1/2; thatis, A, is uniformly random and independent of any 
information held by Eve. This observation is not only relevant for key 
distribution??, but has been crucial for randomness amplification?®. 


Alice and Bob trust the ultimate measuring and controlling devices; that 
is, should they trust themselves? 


Should we trust ourselves? 


We can hardly get more paranoid than that. Can we make free choices or 
are we held to the ransom of a greater force? In other words, what if we 
are manipulated? 

We have already stressed the power of free choice. Decisions such as 
which coin to toss and which polarization to measure must be made 
freely (randomly) and independently. If referring to the experimenter’s 
‘free will’ sounds too esoteric, then think about the random number gen- 
erators that in practical implementations make such choices. Where is 
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their randomness coming from? What if these random number generators 
are of dubious provenance, possibly manufactured by the same person 
who offered the key distribution kit? It is evident that without random- 
ness there is no privacy: if everything is pre-determined, and all possible 
choices we make (with the help of tweaked random number generators or 
otherwise) are predictable or pre-programmed by our adversaries, then 
there is nothing that we can build our privacy on. Or is there? 

There is if the manipulation is not complete and there is a little bit of 
freedom left. If someone we trust tells us that such and such a fraction of 
the choices made by our random number generators cannot be determined 
by the adversary, then privacy is still possible because local randomness can 
be amplified’®. Randomness amplification can itself be done with device- 
independent protocols, and it works even if the fraction of initial random- 
ness is arbitrarily small or the devices are noisy”’”*. 

It all looks bizarre and too good to be true. Perfect privacy, secure against 
powerful adversaries who provide us with cryptographic tools and who 
may even manipulate us? Is such a thing possible? Yes, it is, but ‘the devil 
is in the detail’ and we need to look into some practicalities. 


Practicalities 


Quantum key distribution, in which security is tested by the degree of 
violation of Bell’s inequalities, was proposed some time ago’ and was fol- 
lowed shortly by a proof-of-principle experiment at what used to be called 
the Defence Research Agency (now Qinetiq) in Malvern, UK”’. However, 
the device-independent character of this protocol has not been recog- 
nized until recently’’. Moreover, proving the security of such a scheme in 
the presence of noise has not been easy. It has taken over a decade to agree 
ona useful definition of secrecy, even for trusted devices, and to conclude 
a long sequence of steadily improved security results” ** that eventually 
took into account all the quantum resources that Eve can muster™*. Deal- 
ing with untrusted devices is even more tricky and keeps many of our col- 
leagues busy”?*”. 

Although all security proofs infer secrecy from the monogamy of the 
correlations, a major challenge is to make these arguments quantitative 
and robust to noise and imperfections, and applicable to keys of finite 
size**°. There are other issues as well. For example, here we have taken 
for granted that Alice and Bob can estimate the parameter « from a suf- 
ficiently large sample of their registered data. In the quantum domain, a 
statement of that kind requires a quantum version of what is known in 
classical statistics as de Finetti’s theorem™”. It guarantees that, for instance, 
pairs of photons can be treated as individual objects with individual prop- 
erties and without any hidden correlations to other pairs. These, and 
many other results, addressed a number of subtleties and, finally, twenty 
years after its inception, the original entanglement-based key distribu- 
tion protocol’ has been shown to offer security even if the devices are not 
fully trusted and are exposed to noise'*®. This is assuming that quantum 
theory is all that there is, and that Eve is bound by the laws of quantum 
physics. However, if Alice and Bob are paranoid enough to give Eve some 
‘post-quantum’ powers (technologies more powerful than quantum tech- 
nologies which may rely on as-yet undiscovered physical phenomena that 
are not described by quantum physics), they can still resort to less efficient 
protocols that do not rely on quantum theory”. We should stress, how- 
ever, that device-independent protocols and their security proofs have 
not yet reached the level of sophistication that is now common for the device- 
dependent scenario. In particular, more work is needed to improve the effi- 
ciency of the key distribution protocols or to identify conditions under which 
untrusted devices may be reused in multiple rounds of such protocols. 

Given that violation of Bell’s inequality is an experimental fact, what is 
it that prevents us from running the experiments that violated Bell’s 
inequality again, but this time under the label of the device-independent 
key distribution? Convincing as they are, these experiments still leave 
some loopholes. For example, it is in principle possible that the photons 
detected in the experiments did not represent a fair sample of all photons 
emitted by the source (the “detection loophole’) or that the various parts 
and components of the experiment were causally connected (the ‘local- 
ity loophole’). Some of these concerns were addressed in more recent 
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experiments**“*, but, a single experiment that closes all the loopholes at 
once, demonstrating the ultimate violation of Bell’s inequality, is still lacking. 

This is not so disturbing for physicists, because nature would have to be 
very malicious if it were to cheat us selectively—on locality in some experi- 
ments and in exploring detection loopholes in some other. In contrast, 
there is nothing to prevent an eavesdropper being malicious. In this adver- 
sarial setting, a proper experimental demonstration of device-independent 
cryptography requires a proper violation of Bell’s inequalities. This is 
particularly true for the detection loophole. Imagine, for example, that 
Eve pre-programmed the devices assuming in advance a sequence of 
settings that Alice and Bob may choose for their measurements. When- 
ever her guess is correct, the devices will respond with pre-programmed 
results, and when it is not, one of the devices will simulate failure to respond. 
If Alice and Bob naively discard all the instances in which at least one of 
the devices failed to deliver a result, then they can be easily fooled by Eve. 
Thus, we do need the loophole-free violation of Bell’s inequalities. 

Closing the detection loop-hole is very challenging, because almost any 
optical component adds losses and imperfections to the key distribution 
set-up, but it is within the reach of today’s technology, especially with the 
rapid progress in photodetection techniques. If distance is not an issue, 
then we can achieve near-perfect detection efficiency using entangled ions 
rather than photons”, and this has been used to generate the first device- 
independent certified randomness*’”. Short of full device independence, 
we can also entertain intermediate scenarios, where some parts of the devices 
are trusted and some are not. Indeed, proposals that address issues such 
as untrusted detectors” offer significant improvements over the exist- 
ing quantum key distribution schemes**®° and move secure communi- 
cation in interesting new directions. 

Experimental device-independent cryptography is far from easy, but 
technological progress so far has encouraged optimism. The days we stop 
worrying about untrustworthy or incompetent providers of cryptographic 
services may be not that far away. 


Conclusion 


Over the past decade or so, quantum cryptography has come of age, but 
the field is still an amazingly fertile source of inspiration for fundamental 
research. The search for the ultimate physical limits of privacy is still very 
much a work in progress, but we know that privacy is possible under sur- 
prisingly weak assumptions. Monogamous correlations, of whatever ori- 
gin, and an arbitrarily small amount of free will are sufficient to conceal 
whatever we like. Free will is our most valuable asset. Come to think about 
it, without free will, there is no point in concealing anything anyway. 
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Human neurons are functional over an entire lifetime, yet the mechanisms that preserve function and protect against 
neurodegeneration during ageing are unknown. Here we show that induction of the repressor element 1-silencing tran- 
scription factor (REST; also known as neuron-restrictive silencer factor, NRSF) is a universal feature of normal ageing in 
human cortical and hippocampal neurons. REST is lost, however, in mild cognitive impairment and Alzheimer’s disease. 
Chromatin immunoprecipitation with deep sequencing and expression analysis show that REST represses genes that 
promote cell death and Alzheimer’s disease pathology, and induces the expression of stress response genes. Moreover, 
REST potently protects neurons from oxidative stress and amyloid B-protein toxicity, and conditional deletion of REST 
in the mouse brain leads to age-related neurodegeneration. A functional orthologue of REST, Caenorhabditis elegans 
SPR-4, also protects against oxidative stress and amyloid B-protein toxicity. During normal ageing, REST is induced in 
part by cell non-autonomous Wnt signalling. However, in Alzheimer’s disease, frontotemporal dementia and dementia 
with Lewy bodies, REST is lost from the nucleus and appears in autophagosomes together with pathological misfolded 
proteins. Finally, REST levels during ageing are closely correlated with cognitive preservation and longevity. Thus, the 


activation state of REST may distinguish neuroprotection from neurodegeneration in the ageing brain. 


The preservation of cognitive function during ageing has emerged as 
one of the major medical challenges of the 21st century. A fundamen- 
tal question is why some individuals age with their cognitive function 
relatively intact whereas others decline and develop Alzheimer’s dis- 
ease (AD). Early studies suggested that neuronal loss was an integral 
feature of the ageing brain. With the advent of stereological neuronal 
quantification, however, it became clear that neuronal cell number is 
largely preserved in the neocortex and hippocampus of the ageing human 
brain, declining only in the setting of neurodegenerative disease’. 
Robust stress-response mechanisms must have evolved, therefore, to 
preserve neurons and cognitive function across an entire lifespan*”. 

REST is a repressor of neuronal genes during embryonic develop- 
ment that is downregulated once terminal neuronal differentiation has 
occurred®*. Here we show that REST is induced in the ageing human 
brain and regulates a network of genes that mediate cell death, stress 
resistance and AD pathology. This gene network becomes dysregulated 
at early stages of AD when REST is lost from the nucleus. Conditional 
REST knockout mice and C. elegans models suggest that REST protects 
neurons from age-related toxic insults. In ageing humans, elevated REST 
levels are associated with preservation of cognitive function and increased 
longevity, even in the presence of AD pathology. Hence, REST regu- 
lates a neuroprotective stress response that may be central to cognitive 
preservation during ageing. 


REST is induced in the ageing brain and declines in AD 


Transcriptional profiling has demonstrated significant changes in the 
expression of neuronal genes in the prefrontal cortex of ageing humans””®. 
Analysis of this data set using the Ingenuity Systems IPA platform indi- 
cates that the transcription factor most strongly predicted to be activated 
in the ageing brain is REST (P= 9 X 10 '°). Moreover, the 21-base-pair 
canonical RE] recognition motif for REST is highly enriched in the 
age-downregulated gene set (P = 3 X 10°’) (Fig. 1a). 


To explore the role of REST in the ageing brain, we measured REST 
levels in extracts of prefrontal cortex (PFC) from young adult (20-35 
years) and aged (73-106 years) individuals without AD. REST expres- 
sion was significantly increased in the ageing human PFC at both the 
messenger RNA and protein levels, as determined by quantitative PCR 
with reverse transcription (qRT-PCR) and western blotting (Fig. 1b, c). 
Full-length REST was markedly increased; the truncated splice variant 
REST4 was a minor component, comprising 0.1-0.5% of REST mRNA. 
Immunofluorescence microscopy using three different antibodies against 
the amino- or carboxy-terminal domains of REST showed a striking 
induction of REST in the nucleus of ageing neurons in the PFC and 
hippocampus (Fig. 1d, e and Extended Data Fig. 1). Much lower levels 
of REST were detected in microglial cells and astrocytes (data not shown). 
REST antibody specificity was indicated by ablation of immunoreactivity 
after antibody preabsorption with a REST blocking peptide, absence of 
immunoreactivity with matched nonspecific IgG, and loss of immu- 
noreactivity after shaRNA-mediated REST knockdown in neural SH- 
SY5Y cells (Extended Data Fig. 1b, c). 

We then investigated whether induction of REST in ageing neurons 
leads to increased REST-REI site binding. To assess REST targeting 
specifically in neurons, we isolated neuronal nuclei from the PFC by 
fluorescence-activated cell sorting (FACS) of NeuN-positive nuclei° 
(Methods). Chromatin immunoprecipitation (ChIP)-PCR analysis 
showed a marked induction of REST binding to canonical REI motifs 
in REST target genes in the aged PFC (Fig. 1f). These results indicate 
that REST expression and function is increased in ageing neurons. 

We next examined REST in ageing individuals with mild cognitive 
impairment (MCI) or AD. REST was almost absent from the nucleus 
of cortical and hippocampal neurons in AD (Fig. 1d and Extended Data 
Fig. la, d). Punctate cytoplasmic REST immunoreactivity was detected 
in some AD neurons, but was not detectable in other neurons (Fig. 1d). 
Loss of nuclear REST was confirmed by quantitative immunocytochemical 
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Figure 1 | Induction of REST in the ageing human prefrontal cortex. 

a, Hierarchical cluster analysis of predicted REST targets, based on the presence 
of the canonical RE1 motif”’, shows relatively high expression in young adults 
(red) and lower expression (blue) in the ageing population. Each lane 
represents an individual prefrontal cortical brain sample. b, RT-PCR shows 
age-dependent induction of REST mRNA in the PFC. Values represent the 
mean ~ s.d., n = 3. Shown are P values indicating significance for the mean 
of aged (71-95 years) versus young (24-29 years) cases for each primer. 

c, Age-dependent increase in total REST protein level in the PFC of controls 
but not AD patients. REST levels were also measured in isolated nuclear 
(Nuc REST; lower right panel, upper blot) and cytoplasmic (Cyt REST; lower 


analysis of 77 cases of sporadic AD compared with 72 age-matched 
cognitively intact controls, and confirmed by FACS analysis of neuronal 
nuclei in a subset of cases (Fig. le). In 23 out of 24 cases of amnestic 
MCI, nuclear REST levels were reduced relative to the mean of aged 
controls (Fig. le). A survey of brain regions showed significantly reduced 
nuclear REST levels in affected neuronal populations in AD, including 
prefrontal cortical neurons, and neurons of the CA1, CA3 and CA4 sub- 
fields of the hippocampus (Extended Data Fig. 1a, d). In contrast, nuclear 
REST was not reduced in neurons of the dentate gyrus and cerebel- 
lum, which are relatively unaffected in AD (Extended Data Fig. 1a, d). 
Reduced REST levels in AD were confirmed by western blotting of pre- 
frontal cortical grey matter samples, as well as isolated cortical nuclei 
(Fig. 1c, upper and far right panels, respectively). Furthermore, ChIP- 
PCR analysis of neuronal nuclei showed significantly reduced or absent 
REST-REI site binding in AD relative to normal ageing controls (Figs 1f 
and 2a). Thus, REST is induced during normal brain ageing, but is 
markedly reduced in AD in vulnerable neuronal populations. 


REST represses genes involved in cell death and AD 

To obtain greater insight into the functional consequences of REST 
induction in the ageing brain, we explored REST targets by chromatin 
immunoprecipitation and deep sequencing (ChIP-seq) in the neural 
cell line SH-SY5Y. REST binding to a subset of the identified target 
genes was confirmed by ChIP-qPCR (Extended Data Fig. 2). REST ChIP- 
seq sites were significantly enriched among age-regulated genes in human 
PFC (P = 1.65 X 10”), and were predominantly downregulated from 
the third to the tenth decades of life (Extended Data Fig. 2a). Pathway 
analysis of the ChIP-seq database showed a highly significant enrich- 
ment for genes in cell death pathways and genes that are associated 
with AD/dementia (Extended Data Fig. 2b, c). Cell death genes include 


right panel, lower blot) fractions. Each lane represents an individual case. 
Young, n = 12; Aged, n = 15; AD, n = 10. d, Confocal immunofluorescence 
labelling for REST (green), the neuronal marker MAP2 (red) and DNA (DAPI, 
blue) in the PFC of young adult, aged and AD cases. Scale bar, 25 jim. 

e, Quantitative analysis of nuclear REST levels by in situ imaging (left panel: 
Young, n = 11; Aged, n = 77; AD, n = 72; MCI = 11) or FACS analysis of 
isolated PFC neuronal nuclei (right panel: Young, n = 11; Aged, n = 22; AD, 
n= 11; MCIn = 12). Forcande, values are expressed as fold change relative to 
the young adult group, and represent the mean + s.e.m. *P < 0.05, **P < 0.01, 
***P < 0.001 by Student’s unpaired t-test. f, ChIP analysis shows induction 
of REST-RE] site binding in normal ageing PFC neurons but not in AD. 


p38 MAP kinase (MAPK11), FAS, FADD, TRADD, BAX, BID, DAXX, 
PUMA (also known as BBC3), the mitochondrial permeability transi- 
tion pore ANT1 (also known as SLC25A4) and cytochrome C. REST tar- 
gets related to the pathology of AD include the genes coding for gamma 
secretase complex members presenilin 2 and presenilin enhancer 2 impli- 
cated in amyloid B-protein (AB) generation, and CDK5R1 (also known 
as p35), CDK5R2 (also known as p39), p38 MAP kinase (also known as 
MAPK11, 12 and 14) and 14-3-3 zeta (also known as YWHAZ) impli- 
cated in tau (also known as MAPT) phosphorylation. REST repressed 
the expression of these genes in SH-SY5Y cells (Extended Data Fig. 3a, b), 
and strongly inhibited tau phosphorylation at the AT8 and PHF! epi- 
topes associated with AD (Extended Data Fig. 3c). 

We next examined the regulation of these REST target genes in the 
normal ageing brain and AD. To distinguish effects at different stages 
of cognitive decline, we stratified the AD population for least severe 
(AD1; Mini Mental State Examination (MMSE) score > 18) and most 
severe (AD2; MMSE score < 8). REST targets related to cell death and 
AD pathology showed reduced REST binding and elevated mRNA 
expression at both stages of AD (Fig. 2a, b). REST target genes related 
to neurotransmission, however, showed elevated mRNA expression 
in early stage AD1, but reduced expression in later stage AD2 (Fig. 2b). 
Regression analysis of protein levels showed that nuclear REST is inversely 
correlated with levels of pro-apoptotic (BAX and DAXX) and AD-related 
(presenilin 2) proteins in normal ageing and AD neurons (Fig. 2c, d). 
This was also observed for REST targets associated with neurotrans- 
mission (SST and CALB1) during normal ageing and early stage AD1, 
but not in late stage AD2 (Fig. 2c). The activation-associated histone 
modification H3K9ac, which is typically reduced by REST, was glob- 
ally downregulated in normal ageing PFC neurons (Fig. 2e) and was 
inversely correlated with REST levels (Fig. 2c, d). In contrast, neuronal 
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Figure 2 | Regulation of REST target genes in 
ageing and AD. a, ChIP-qPCR analysis of REST 
binding to target genes in isolated PFC neuronal 
nuclei. Shown are young adult (Young), normal 
aged (Aged) and AD cases, which were stratified as 
AD1 (MMSE score > 18) and AD2 (MMSE 

score < 8). Also shown are control ChIP assays 
with non-specific IgG, and a REST ChIP with PCR 
primers directed 10 kb 3’ to the SNAP25 RE] site 
(SNAP25-10K down). Values represent fold change 
relative to the mean young adult value, and 
represent the mean + s.e.m., n = 7. *P < 0.05 by 
the Mann-Whitney test. b, mRNA expression of 
REST target genes determined by qRT-PCR of 
PFC. Shown are REST target genes related to cell 
death pathways and AD pathology (left), and 
neurotransmission (right). Values are normalized 
to the mean young adult expression level (100%) 
and represent the mean + s.e.m. Young, n = 9; 
Aged, n = 10; AD1, n = 10; AD2, n = 10. Asterisks 
above the ‘Aged’ bars indicate significance relative 
to young adults; brackets indicate significance 
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of nuclear REST versus levels of protein targets or 
histone H3K9ac in double-labelled PFC neurons. 
d, Representative images showing inverse 
relationships between nuclear REST and H3K9ac, 


R=-0.69 


R=-0.18 


AD2 


PS2, DAXX and BAX in AD1 neurons. Scale bar, 
20 pum. e, Global epigenetic regulation in ageing 
and AD. Histone modification H3K9ac was 
determined in isolated PFC neuronal nuclei. 


H3K9ac increased significantly in early AD (Fig. 2e). Thus, loss of 
nuclear REST in AD is associated with epigenetic derepression of poten- 
tially pathogenic genes. 


REST is neuroprotective 


Because REST represses pro-apoptotic genes, we explored a role in 
neuroprotection by assessing vulnerability to oxidative stress and olig- 
omeric Af, stressors associated with brain ageing’ and AD”. Primary 
cultures of mouse E16 cortical neurons were established from Nestin- 
Cre:REST*™ (REST cKO) and littermate control embryos. After 1-2 weeks 
in culture, neuronal differentiation was comparable in REST-deficient 
and control cultures. Upon treatment with hydrogen peroxide or oligo- 
meric AB42, REST-deficient neurons showed markedly increased degen- 
eration and cell death relative to controls (Fig. 3a and Extended Data 
Fig. 4a). REST-deficient neurons were rescued by lentiviral transduc- 
tion of wild-type REST, leading to expression levels 2-2.5-fold that of 
wild type (Fig. 3a, Extended Data Fig. 4a). In contrast, a REST mutant 
that is deficient in nuclear translocation (AREST)!*"* was unable to rescue 
(Fig. 3a, b). REST-deficient neurons showed increased expression of 
pro-apoptotic genes in response to oxidative stress and oligomeric AB 
(Extended Data Fig. 4b). Thus, endogenous REST is neuroprotective. 

REST-deficient mice showed a progressive age-related neurodegen- 
erative phenotype. At 1 month of age, neuronal numbers in the cortex 
and hippocampus of REST cKO mice were not significantly different from 
controls. Neuronal degeneration and apoptosis appeared by 8 months 
of age, as indicated by positive neuronal labelling for cleaved caspase-3 
and TdT-mediated dUTP nick end labelling (TUNEL; Fig. 3c, d). This 
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was accompanied by significant neuronal loss in the cerebral cortex 
and hippocampus, and pronounced gliosis (Fig. 3c, d). These results 
indicate that REST is essential for maintaining neuronal viability in the 
ageing brain. 

To confirm the findings on REST-deficient neurons, we assessed 
vulnerability to oxidative stress after REST overexpression or short hair- 
pin RNA-mediated knockdown in neural SH-SY5Y cells. Cell death 
induced by hydrogen peroxide was significantly reduced when REST 
was overexpressed by 3- to 20-fold (Extended Data Fig. 4c). Higher 
levels of REST overexpression increased cell death. REST knockdown 
by lentiviral shRNA transduction significantly increased HO-induced 
apoptosis in both SH-SY5Y cells and cultured human cortical neurons, 
which was prevented by shRNA-resistant mouse REST (Extended Data 
Fig. 4c, d). Furthermore, shRNA-mediated REST knockdown increased 
levels of reactive oxygen species (ROS), which was prevented by the 
antioxidant N-acetyl-cysteine (NAC) (Extended Data Fig. 4e, f). More- 
over, REST knockdown increased oxidative DNA damage, which was 
reduced by REST overexpression (Extended Data Fig. 4g). To examine 
the relationship of REST expression to oxidative DNA damage in the 
human brain, we double-labelled for REST and the oxidative DNA dam- 
age marker 8-oxoguanine (8-oxoG). Linear regression analysis showed 
a significant inverse relationship between nuclear REST and 8-oxoG 
levels in normal ageing neurons and in AD (Extended Data Fig. 4h). 
Thus, REST expression during ageing may reduce oxidative damage. 

The nematode Caenarhobditis elegans has served as a model system 
of ageing and stress resistance*. Two multi-zinc finger transcription 
factors in C. elegans structurally resemble mammalian REST (spr-3 and 
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Figure 3 | REST is neuroprotective. a, REST-deficient cortical neurons 
cultured from REST cKO mouse embryos show increased vulnerability to H,O, 
and oligomeric AB42 toxicity. Cell viability is expressed as percentage of the 
value in untreated cultures. Values represent the mean = s.e.m., n = 6-8. 

*P < 0.05, **P < 0.01 by Student’s unpaired t-test. b, Reduced nuclear 
translocation of the AREST mutant. c, REST cKO mice show age-related 
neurodegeneration. Shown are sections of the hippocampal CA1 subfield 
from 8-month-old REST cKO mice labelled for TUNEL, cleaved caspase 3 
(cl-CASP3) and GFAP, or stained with haematoxylin and eosin (H&E). 

d, Neurodegeneration in 8-months but not 1-month-old REST cKO mice in the 
cortex, and in hippocampal CA1 and dentate gyrus (DG) regions. Values 
represent the mean = s.e.m., n = 6 mice; **P < 0.001 relative to control by 
Student’s unpaired t-test. Scale bars, 20 um. 


spr-4)'*"°, and another gene is closely homologous to CoREST (spr-1)’°. 
To test the role of these genes in oxidative stress resistance, we placed 
adult worms on nematode growth medium supplemented with para- 
quat, a superoxide generator. C. elegans lines with mutations in spr-1, 
spr-3 and spr-4 all showed significantly reduced survival relative to 
wild-type controls (Fig. 4a and Extended Data Fig. 5a). The spr-4(by105) 
mutant showed the greatest vulnerability. Furthermore, depletion of 
spr-4 by RNA interference significantly reduced survival in wild-type 
worms after treatment with paraquat (Extended Data Fig. 5b). SPR-4 
was expressed predominantly in neurons (Extended Data Fig. 5c), and 
was induced by oxidative stress (Extended Data Fig. 5d). 

The increased sensitivity of the spr-4 mutant worms to oxidative 
stress was significantly reduced by transgenic expression of either wild- 
type SPR-4 or human REST (Fig. 4a, b). To ascertain whether SPR-4 
and REST directly modulate ROS levels, worms were labelled with the 
fluorescent ROS indicator 2’,7’-dichlorodihydrofluorescein diacetate 
(DCFDA). spr-4(by105) mutants treated with paraquat showed signif- 
icantly elevated ROS levels relative to wild-type, which was reduced 
by transgenic expression of either wild-type SPR-4 or human REST 
(Fig. 4c, d). To determine whether REST can substitute for SPR-4 as a 
transcriptional repressor, we assessed the expression of the presenilin 
gene hop-1, which is repressed by SPR-4. The expression of hop-1 was 
elevated in adult spr-4 mutants relative to wild-type worms, and was 
repressed by both wild-type SPR-4 and human REST (Extended Data 
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Figure 4 | Caenorhabditis elegans SPR-4 protects against oxidative stress 
and AB toxicity. a, spr-4(by105) worms incubated continuously with paraquat 
(5mM) exhibit increased mortality rescued by wild-type SPR-4 or human 
REST. Shown is a representative experiment replicated three times. 

b, Quantitative analysis of survival in wild-type worms expressing mCherry 
or SPR-4 (WT+mCherry and WT+SPR-4), spr-4(by105) mutants, and 
spr-4(by105) mutants expressing SPR-4 or human REST (spr4(by105)+SPR4 
and spr4(by105)+REST). Shown is the percentage change in mean survival 
relative to wild-type. Values represent the mean + s.d., n = 3 independent 
replicates of at least 30 animals per genotype; **P < 0.01 relative to wild type 
by the log-rank test. c, REST and SPR-4 reduce levels of reactive oxygen species 
(ROS). Shown are representative confocal images of paraquat-treated worms 
labelled with the ROS-sensitive dye DCFDA. WT, wild type. Scale bar, 30 jum. 
d, Quantitation of ROS levels by DCFDA labelling. Horizontal bars indicate 
the median; boxed areas represent the second and third quartiles. *P < 0.05 
relative to wild type by analysis of variance with post-hoc Tukey test; n = 30 
worms. AFU, arbitrary fluorescence units. e, SPR-4 protects against AB 
neurotoxicity. Shown are AB worms (expressing a stably integrated AB1-42 
transgene’) and AB;spr4(by105) worms. Neuronal degeneration does not 
occur in wild-type worms or spr4(by105) mutants in the absence of the AB 
transgene. Values represent the percentage of worms that retain five 
glutamatergic tail neurons at the indicated age (days), and are the mean + s.d., 
n= 3. *P <0.05 by unpaired t-test. 


Fig. 5e). These results indicate that human REST can functionally sub- 
stitute for SPR-4 in C. elegans. 

SPR-4 also modulated the neurotoxicity of AB in a C. elegans line 
that expresses AB42 in glutamatergic neurons and undergoes age- 
dependent neuronal loss’’. When this AB-expressing line was crossed 
with the spr-4(by105) mutant, the resulting line expressed AB42 in an 
spr-4 loss-of-function background and showed significantly acceler- 
ated neurodegeneration (Fig. 4e). Thus, SPR-4 protects against both 
oxidative stress and Af toxicity, consistent with a general role in stress 
resistance. 


Induction of REST by stress and Wnt signalling 


To determine whether REST might be induced by oxidative and other 
forms of stress in the ageing brain, we subjected primary human cor- 
tical neuronal cultures to a variety of stressors. Incubation with redox- 
active iron (Fe *”), hydrogen peroxide, the glutathione synthesis inhibitor 
buthionine sulphoxide (BSO), or AB42-induced REST mRNA and pro- 
tein and increased REST-RE1 site binding (Extended Data Fig. 6a-f). 
To determine whether secreted factors might contribute to REST induc- 
tion, cortical neurons were incubated with H2O, followed by removal 
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of HO, and generation of conditioned medium. Addition of stress- 
related conditioned medium to naive neuronal cultures resulted in a 
marked induction of REST mRNA expression (Extended Data Fig. 6g). 
Thus, a variety of stressors induce REST expression, which is mediated 
at least in part through cell non-autonomous signalling. 

To explore the role of cell non-autonomous signalling in the ageing 
brain, we generated extracts of human PFC from young adult and 
aged samples and assessed their effects on SH-SY5Y cells. REST was 
robustly induced by extracts of aged human cortex, but minimally by 
extracts of young adult cortex (Extended Data Fig. 6h). Furthermore, 
extracts of AD cortex showed reduced REST-inducing activity relative 
to age-matched controls (Extended Data Fig. 6h). Because REST has 
been identified as a target of canonical Wnt-f-catenin signalling’’, we 
examined the role of this cell non-autonomous signalling pathway. 
Extracts of aged cortex and conditioned medium from stressed cells 
treated with H,O2, tunicamycin or wortmannin induced f-catenin in 
parallel with REST (Extended Data Figs 6h and 7a, b). Induction of 
REST and f-catenin was partially inhibited by the Wnt signalling antag- 
onist Dickkopf (Extended Data Fig. 7a, b). REST was also induced by 
addition of purified Wnt-3a and Wnt-7a. Moreover, the GSK3-B inhib- 
itors lithium chloride and Chiron 99021 that activate Wnt signalling 
induced REST, increased REST-RE] site binding, and augmented nuclear 
translocation (Extended Data Fig. 7c-f). To assess Wnt signalling in 
the human brain, we measured nuclear B-catenin levels in the PFC of 
young adult and aged cases. Nuclear B-catenin was significantly ele- 
vated in ageing PFC neurons and colocalized with REST (Extended 
Data Fig. 7g, h). These results indicate that Wnt-f-catenin signalling 
may contribute to the induction of REST in the ageing brain. 


Young 


Aged 


REST (AFU) 
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ST in neurodegene 
ization to punctate cytoplasmic structures that were labelled with the 
autophagosome markers LC3, ATG7 and ATG12, but not the lysosomal 
markers LAMP1 and LAMP2 (Extended Data Fig. 8a). To determine if 
autophagy could alter the nuclear:cytoplasmic distribution of REST, 
autophagy was activated in SH-SY5Y cells by serum deprivation. This 
resulted in significantly reduced nuclear REST, which was reversed by 
treatment with 3-methyladenine or bafilomycin, both inhibitors of auto- 
phagy (Extended Data Fig. 8b, c). Because activation of autophagy is char- 
acteristic of many age-related neurodegenerative disorders, we asked 
whether REST is also depleted in other dementing diseases. Examina- 
tion of cortical sections from patients with frontotemporal dementia 
(FTD) or dementia with Lewy bodies (DLB) showed marked depletion 
of nuclear REST (Fig. 5a). 

Induction of autophagy in neurodegenerative disorders has been 
associated with pathogenic misfolded proteins such as AB (ref. 19). In 
AD, many cortical and hippocampal neurons showed a notable colo- 
calization of REST with AB in LC3-positive autophagosomes (Fig. 5b). 
In contrast, REST did not colocalize with AB in amyloid plaques. Phos- 
phorylated tau and TDP-43 have been implicated as abnormally folded 
pathogenic proteins in FTD, whereas o-synuclein has been implicated 
in DLB. Every case (8) of tau-positive FTD examined showed colocali- 
zation of REST with phosphorylated tau ( PHF1 epitope) in LC3-positive 
autophagosomes (Fig. 5b). Similarly, every case (6) of TDP-43-positive 
FTD showed colocalization of REST with TDP-43 and LC3 (Fig. 5b). 
All ten cases of DLB examined showed partial colocalization of REST 
with «-synuclein in autophagosomes (Fig. 5b). Thus, autophagy of REST, 


Figure 5 | REST, autophagy and proteostasis in 
AD, DLB and FTD. a, Loss of nuclear REST in 


2009 x oe DLB and FTD. Left, immunofluorescence 
microscopy with labelling for REST (green), 
400 the neuronal marker MAP2 (red) and DNA 


(DAPI, blue). Right, quantitative analysis of 
immunofluorescence. Values (AFU) represent the 
mean + s.e.m.; Young, n = 11; Aged, n= 21; 
DLB, n = 18; FTD, n = 16. **P< 0.001 by 
Student’s unpaired t-test. b, Colocalization of REST 
with disease-associated misfolded proteins in 
autophagosomes. Cortical sections were 
triple-labelled for REST, LC3, and either AB in AD, 
a-synuclein (a-syn) in DLB, or phosphorylated 
tau (PHF1 epitope) or TDP-43 in FTD. Two 
different representative cases are shown for each 
disease. Scale bars, 20 um. 


©2014 Macmillan Publishers Limited. All rights reserved 


a 
al 
ep} 
Ww 
oc 
Episodic memory (Z-score) Semantic memory (Z-score) 
300 R=0.51 3005 
cE P<104  sagdey * 
D 200 < . 2 7pt. ae P 200 
o 100 . a 1004 
0+ 1 0) 
-4 2 -4 
Working memory (Z-score) Perceptual speed (Z-score) 
b 6 
co} . R=-0.41 
= Re] P<0.05 
n = ‘ e e 
Ww lod e 
a 2 e . 
oe 
2 o+ 28, 
Global cognitive (Z-score) 0 ? 2 2 4 a 
Braak score 
c d 
NCI/MCI 
= BAD dementia 
xe) 
o 3 
i 
E 2 — i. 
ip) 
Ww 1 
0 
CERAD1-2 REAGAN1-2 60 70 80 90 100 110 


Age (yr) 


Figure 6 | Nuclear REST is positively correlated with cognitive function and 
longevity. a, Linear regression analysis of nuclear REST levels in PFC neurons 
and cognitive test scores. Nuclear REST was imaged by anti-REST 
immunofluorescence and quantified by MetaMorph. Each point represents an 
individual case. n = 111 cases (59 females, 52 males), age range 71-90 years. 
b, Nuclear REST levels are inversely correlated with Braak stage (extent of 
neurofibrillary tangle formation). c, Nuclear REST levels in PFC from cases 
with neuropathological AD (moderate/frequent plaques by CERAD score 
and intermediate/high likelihood AD by NIA-Reagan criteria) that had mild 
or no cognitive impairment (NCI/MCI), or AD dementia. Values are the 
mean + s.e.m. *P < 0.01 by Student’s unpaired t-test; NCI/MCI n = 30, AD 
dementia n = 21. d, Nuclear REST levels in ageing neurons correlate positively 
with longevity. REST levels were quantified by FACS analysis of isolated PFC 
neuronal nuclei in 61 individuals without AD (age range 67-104 years). 


together with pathogenic misfolded proteins, may reduce nuclear trans- 
location in AD and other neurodegenerative disorders. 


REST, cognitive preservation and longevity 


A central question is whether loss of REST in AD contributes to cog- 
nitive decline. We explored this question by using measures of cogni- 
tive function derived from neuropsychiatric assessments performed 
longitudinally in the Religious Orders Study and the Rush Memory 
and Ageing Project (Methods). Linear regression analysis showed that 
nuclear REST levels in prefrontal cortical neurons are positively cor- 
related with a measure of global cognition, and separate measures of 
episodic, semantic and working memory with high levels of statistical 
significance (Fig. 6a and Extended Data Fig. 9a). Similar results were 
obtained in pyramidal neurons in the hippocampal CA1 region. For 
both brain regions, REST levels were most robustly correlated with 
episodic memory, which typically declines early in the course of AD. Ele- 
vated REST levels also predicted reduced AD neuropathology (Fig. 6b 
and Extended Data Fig. 9b). 

We then asked if elevated REST levels might confer resistance to AD 
pathology. Cases with National Institutes on Aging and Reagan Institute 
(NIA-Reagan) and Consortium to Establish a Registry for Alzheimer’s 
disease (CERAD) scores indicative of pathological AD, but with mild 
or no cognitive impairment, were compared to cases with pathological 
AD together with severe cognitive impairment. Nuclear REST levels 
were significantly elevated in cognitively preserved cases relative to cog- 
nitively impaired cases, despite the fact that both groups met patho- 
logical criteria for AD (Fig. 6c). Thus, elevated REST levels are associated 
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with cognitive preservation in individuals who meet pathological cri- 
teria for AD. 

Because REST is associated with stress resistance and preservation 
of cognitive function, we explored the relationship to human lifespan. 
Nuclear REST levels in PFC neurons showed a significant positive cor- 
relation with longevity (R = 0.53; P< 10 *) (Fig. 6d). This relationship 
remained significant after multivariate adjustment for cognitive test scores 
and indices of neuropathology. Furthermore, the expression of REST 
target genes showed a highly significant inverse correlation with lon- 
gevity (Extended Data Fig. 2a). Examination of the neuroanatomical 
distribution of REST in individuals with extreme longevity showed 
marked induction in neurons of the PFC and hippocampus, but not in 
the Purkinje or granule cell neurons of the cerebellum (Extended Data 
Fig. 10a). Thus, REST expression in specific neuronal populations cor- 
relates with longevity. 

We then asked if REST regulates genes that have been implicated in 
stress resistance and the ageing process. In addition to repressing cell 
death genes, REST also promoted expression of the anti-apoptotic gene 
BCL2, the antioxidants catalase and SOD1, and the FOXO transcrip- 
tion factor FOXO1 (Extended Data Figs 3a and 10b). As FOXO1 has 
been implicated in oxidative stress resistance’ and human longevity”’, 
we assessed its regulation by REST at the protein level. REST knock- 
down by shRNA transduction in SH-SY5Y cells almost completely abol- 
ished FOXO1 expression, which was reversed by an shRNA-resistant 
mouse REST vector (Extended Data Fig. 10c). Thus, REST may broadly 
confer stress resistance during ageing. 


Discussion 


We have demonstrated a striking induction of the developmental tran- 
scriptional repressor REST in specific neuronal populations of the ageing 
human brain together with epigenetic repression of REST target genes. 
REST has been investigated extensively as a repressor of neuronal genes 
during embryonic development, a function that persists in adult non- 
neuronal cells”. Our findings suggest that REST additionally plays a 
role as a neuroprotective modulator, in part by repressing genes that 
promote cell death and the pathology of AD. Moreover, REST increases 
the expression of FOXO transcription factors that mediate oxidative 
stress resistance”, as well as the antioxidant enzymes catalase and SOD1, 
possibly through indirect mechanisms such as repression of micro RNAs. 
Consistent with these findings, REST confers oxidative stress resistance 
and protects against toxic insults associated with AD, including AB 
oligomers and tau phosphorylation. Furthermore, REST appears to be 
essential for maintaining neuronal viability in the normal ageing cortex 
and hippocampus. This protective function is conserved in C.elegans. 
Previous ChIP-SACO” and ChIP-chip” studies also identified REST 
target genes involved in cell death. In addition, a recent study suggests 
that REST protects against the toxicity of 1-methyl-4-phenyl-1,2,3,6- 
tetrahydropyridine (MPTP), a model of Parkinson’s disease”®. Thus, 
REST may coordinate a stress response that is broadly neuroprotective 
in the ageing brain. 

REST dysfunction may contribute to the pathogenesis of a number 
of different neurodegenerative disorders. In addition to AD, REST was 
also significantly depleted in frontotemporal dementia and dementia 
with Lewy bodies. In each of these disorders, REST was lost from the 
nucleus and appeared in autophagosomes together with pathological 
misfolded proteins, including AB, phosphorylated tau, TDP-43 and a- 
synuclein. This may represent a common pathogenic mechanism that 
links altered proteostasis to aberrant gene expression. 

Recent studies suggest that epigenetic regulation of chromatin may 
modulate the cognitive outcome ofa variety of neuropathological states”. 
It is intriguing that ageing individuals who harbour substantial AD 
pathology do not appear to progress to dementia when neuronal REST 
levels are high. This raises the possibility that structural pathology, such as 
AB deposition and neurofibrillary tangles, may not be sufficient to cause 
dementia. Rather, failure of the brain’s stress response system may also 
be necessary, suggesting new possibilities for therapeutic intervention. 
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METHODS SUMMARY 


Post-mortem human brain material was procured in accordance with institutional 
guidelines governed by approved protocols. Tissue samples were procured from 
the Rush University Medical Center, the University of Maryland, Duke University, 
the Brigham and Women’s Hospital, and the Massachusetts General Hospital. Mouse 
protocols were approved by the Institutional Animal Care and Use Committee of 
Harvard Medical School. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Enhancers control the correct temporal and cell-type-specific activation of gene expression in multicellular eukaryotes. 
Knowing their properties, regulatory activity and targets is crucial to understand the regulation of differentiation and 
homeostasis. Here we use the FANTOMS panel of samples, covering the majority of human tissues and cell types, to produce 
an atlas of active, in vivo-transcribed enhancers. We show that enhancers share properties with CpG-poor messenger RNA 
promoters but produce bidirectional, exosome-sensitive, relatively short unspliced RNAs, the generation of which is strongly 
related to enhancer activity. The atlas is used to compare regulatory programs between different cells at unprecedented 
depth, to identify disease-associated regulatory single nucleotide polymorphisms, and to classify cell-type-specific and 
ubiquitous enhancers. We further explore the utility of enhancer redundancy, which explains gene expression strength 
rather than expression patterns. The online FANTOMS5 enhancer atlas represents a unique resource for studies on cell- 


type-specific enhancers and gene regulation. 


Precise regulation of gene expression in time and space is required for 
development, differentiation and homeostasis’. Sequence elements within 
or near core promoter regions contribute to regulation’, but promoter- 
distal regulatory regions like enhancers are essential in the control of 
cell-type specificity’. Enhancers were originally defined as remote ele- 
ments that increase transcription independently of their orientation, 
position and distance to a promoter’. They were only recently found to 
initiate RNA polymerase II (RNAPII) transcription, producing so-called 
eRNAs*. Genomic locations of enhancers can be detected by mapping 
of chromatin marks and transcription factor binding sites from chro- 
matin immunoprecipitation (ChIP) assays and DNase I hypersensitive 
sites (DHSs) (reviewed in ref. 1), but there has been no systematic 
analysis of enhancer usage in the large variety of cell types and tissues 
present in the human body. 

Using cap analysis of gene expression® (CAGE), we show that enhan- 
cer activity can be detected through the presence of balanced bidirec- 
tional capped transcripts, enabling the identification of enhancers from 
small primary cell populations. Based upon the FANTOMS5 CAGE express- 
ion atlas encompassing 432 primary cell, 135 tissue and 241 cell line 


samples from human’, we identify 43,011 enhancer candidates and char- 
acterize their activity across the majority of human cell types and tissues. 
The resulting catalogue of transcribed enhancers enables classification 
of ubiquitous and cell-type-specific enhancers, modelling of physical 
interactions between multiple enhancers and TSSs, and identification of 
potential disease-associated regulatory single nucleotide polymorph- 
isms (SNPs). 


Bidirectional capped RNAs identify active enhancers 


The FANTOMS project has generated a CAGE-based transcription 
start site (TSS) atlas across a broad panel of primary cells, tissues and 
cell lines covering the vast majority of human cell types®. Within that 
data set, well-studied enhancers often have CAGE peaks delineating 
nucleosome-deficient regions (NDRs) (Supplementary Fig. 1). To 
determine whether this is a general enhancer feature, FANTOM5 CAGE 
(Supplementary Table 1) was superimposed on active (H3K27ac-marked) 
enhancers defined by HeLa-S3 ENCODE ChIP-seq data’. CAGE tags 
showed a bimodal distribution flanking the central P300 peak, with 
divergent transcription from the enhancer (Fig. 1a). Similar patterns 
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Figure 1 | Bidirectional capped RNAs is a signature feature of active 
enhancers. a, Enhancers identified by co-occurrence of H3K27ac and 
H3K4mel ChIP-seq data’, centred on P300 binding sites, in HeLa cells were 
overlaid with HeLa CAGE data (unique positions of CAGE tag 5’ ends, 
smoothed by a 5-bp window), revealing a bidirectional transcription pattern. 
Horizontal axis shows the + 500 bp region around enhancer midpoints. 

b, Density plot illustrating the difference in directionality of transcription 


were observed in other cell lines (Supplementary Fig. 2a). Enhancer- 
associated reverse and forward strand transcription initiation events 
were, on average, separated by 180 base pairs (bp) and corresponded to 
nucleosome boundaries (Supplementary Figs 3 and 4). Asa class, active 
HeLa-S3 enhancers had 231-fold more CAGE tags than polycomb- 
repressed enhancers, indicating that transcription is a marker for active 
usage. Indeed, ENCODE-predicted enhancers’ with significant reporter 
activity* had greater CAGE expression levels than those lacking reporter 
activity (P<4 x 10 77, Mann-Whitney U test). A lenient threshold 
on enhancer expression increased the validation rate of ENCODE 
enhancers from 27% to 57% (Supplementary Fig. 5). 

Although capped RNAs of protein-coding gene promoters were 
strongly biased towards the sense direction, similar levels of capped 
RNA in both directions were detected at enhancers (Fig. 1b and Sup- 
plementary Fig. 2b, c). Thus, bidirectional capped RNAs is a signature 
feature of active enhancers. On this basis, we identified 43,011 enhan- 
cer candidates across 808 human CAGE libraries (see Supplementary 
Text and Supplementary Figs 6-8). Interestingly, the candidates were 
depleted of CpG islands (CGI) and repeats (with the exception of 
neural stem cells, see ref. 9). 

To confirm the activity of newly identified candidate enhancers, we 
randomly selected 46 strong, 41 moderate and 36 low activity enhancers 


Prediction category 


according to FANTOM5-pooled CAGE tags mapped within + 300 bp of 
22,486 TSSs of RefSeq protein-coding genes and centre positions of 10,138 
HeLa enhancers defined as above. c, Success rates of 184 in vitro enhancer 
assays in HeLa cells. Vertical axis shows the fraction of active enhancers 
(success defined by Student’s t-test, P< 0.05 versus random regions; also see 
Supplementary Fig. 9). Numbers of successful assays are shown on the 
respective bar. See main text for details. 


(as defined by CAGE tag frequency in HeLa cells) and examined their 
activity using enhancer reporter assays compared to randomly selected 
untranscribed loci with regulatory potential in HeLa-S3 cells: 15 DHSs"°, 
26 ENCODE-predicted ‘strong enhancers” and 20 enhancers defined 
as in Fig. la (Supplementary Tables 2 and 3). Whereas 67.4-73.9% of 
the CAGE- defined enhancers showed significant reporter activity, only 
20-33.3% of the untranscribed candidate regulatory regions were active 
(Fig. 1c and Supplementary Fig. 9a). The same trend was observed in 
HepGz2 cells (Supplementary Fig. 10a, b). Corresponding promoter-less 
constructs showed that the enhancer transcription read-through is 
negligible (Supplementary Fig. 9b, c). Many CAGE-defined enhancers 
overlapped predicted ENCODE ‘strong enhancers’ or ‘TSS’ states (25% 
and 62%, respectively, for HeLa-S3), but there was no substantial differ- 
ence in validation rates between these classes (Supplementary Fig. 10c, d). 
In summary, active CAGE-defined enhancers were much more likely to 
be validated in functional assays than untranscribed candidate enhan- 
cers defined by histone modifications or DHSs. 


Initiation and fate of enhancer RNAs versus mRNAs 

RNA-seq data from matching primary cells and tissues showed that 
~95% of RNAs originating from enhancers were unspliced and typ- 
ically short (median 346 nucleotides)—a striking difference to mRNAs 


b c d S e .. 
124 2 = Ss ‘ 
2 Ss oe So @ ares ey 
cay 2) gy 8 ne = Q AGHARV Db — 3.0 
4 S ae re Bank —msokoatcs 
3°* 3 8 882 o8 S0S883 So 8CRSSex pom” FE 
Boa 3 8 Tose es asa Soe Hsh Sos Tsk SSM, $825 
. ia ;2s5- reo ploaeetts eo 3 
3 is ERGS BOM RES SSQgSI aka mo £5 2a 
2 QOS SOF <gn be We ees a 2.0 
0.0 3 PQ TIVQSSOMMGSSSSSLHOS HGS BS 20 oa 2 
04 8 QE Rs kad wee BSS ead ies 58 a 
160 tk 10k 10K SESE 6 8 SENET 5° “BAG Ob BeBe: 85 15 
Length (| t 6 S5 £5) ORO OHODGG-G0F) 96 ; 
ao FES gba peussoshess eee Si0f:. « 
= Ethancer NA secomic 8 SEeGeSureeeotarSsoshas | TO 
os SHERSroRe pee seanes coast 150" 6” 70 
~- GENCODE transcript, genomic , 0 500 1,000 Q IAS SESE GRR RS Bats SOT OHDS Relative distance to TSS 
Distance (bp) , ail i | rete or enhancer midpoint 
— nCGI TSS sense — CGI TSS antisense 24 by ny — Enhancer + strand 
as p Enhancer =— Enhancer - strand 
CGI TSS sense — Enhancer CGl-promoter — RefSeq TSS sense 
nCGITSS antisense -- Genomic expectation High HZ Low —{ map eae — RefSeq TSS antisense 


DNase | cleavage 
(per nucleotide) 


Figure 2 | Features distinguishing enhancer TSSs from mRNA TSSs. 

a, Densities of the genomic and processed RNA lengths of transcripts starting 
from enhancer TSSs and mRNA TSSs using assembled RNA-seq reads from 
13 pooled FANTOMS libraries. b, Frequencies of RNA processing motifs 

(5’ splice motif (5’SS, upper panel) and the transcription termination site 
hexamer (TTS, lower panel)) around enhancer and mRNA TSSs. Vertical axis 
shows the average number of predicted sites per kb within a certain window 
size from the TSS (horizontal axis) in which the motif search was done. Dashed 
lines indicate expected hit density from random genomic background. The 
window always starts at the gene or enhancer CAGE summits and expands 
in the sense direction. nCGI, non-CGI. c, Average nucleotide frequencies 
(top panel) and DNase I cleavage patterns (lower panel) of enhancer CAGE 
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peaks (arrow at +1 indicates position of the main enhancer CAGE peaks; 
direction of transcription goes left to right) reveal distinct cleavage patterns 
at sequences resembling the INR and TATA elements. d, De novo motif 
enrichment analyses around enhancers and non-enhancer FANTOM5 
CAGE-defined TSSs (CAGE TSSs matching annotated TSSs are referred to 
as ‘promoters’), contingent on CGI overlap. Top enriched/depleted motifs are 
shown along with their best-known motif match name. Enrichment versus 
random background is presented as a heat map. e, Vertical axis shows average 
HeLa CAGE expression fold change versus control at enhancers and RefSeq 
TSSs after exosome depletion. Horizontal axis shows position relative to the 
TSS or the centre of the enhancer. Translucent colours indicate the 95% 
confidence interval of the mean. 
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(19% unspliced, median 1,256 nucleotides) (Fig. 2a and Supplementary 
Fig. 1la—c). Unlike TSSs of mRNAs, which are enriched for predicted 
5’ splice sites but depleted of downstream polyadenylation signals’’”’, 
enhancers showed no evidence of associated downstream RNA pro- 
cessing motifs, and thus resemble antisense promoter upstream tran- 
scripts (PROMPTs)”" (Fig. 2b and Supplementary Fig. 11d). Most CAGE- 
defined enhancers gave rise to nuclear (>80%) and non-polyadenylated 
(~90%) RNAs" (Supplementary Fig. 1le). Based on RNA-seq, few 
enhancer RNAs overlap exons of known protein-coding genes or large 
intergenic noncoding RNAs (9 and 1 out of 4,208 enhancers detected, 
respectively), indicating that they are not a substantial source of alter- 
native promoters for known genes (as in ref. 14). 

TSS-associated, uncapped small RNAs (TSSa-RNAs), attributed to 
RNAPII protection and found immediately downstream of mRNA 
TSSs’*"®, were detectable in the same positions downstream of enhan- 
cer TSSs (Supplementary Fig. 12), indicating that RNAPII initiation at 
enhancer and mRNA TSSs is similar. Indeed, CAGE-defined enhancer 
TSSs resembled the proximal position-specific sequence patterns of 
non-CGI RefSeq TSSs (Fig. 2c and Supplementary Fig. 13a). Furthermore, 
de novo motif analysis revealed sequence signatures in CAGE-defined 
enhancers closely resembling non-CGI promoters (Fig. 2d and Sup- 
plementary Fig. 13b). 

Because of the similarity with PROMPTs, we reasoned that capped 
enhancer RNAs might be rapidly degraded by the exosome. Indeed, 
small interfering RNA-mediated depletion of the SKIV2L2 (also known 
as MTR4) co-factor of the exosome complex resulted in a median 3.14- 
fold increase of capped enhancer-RNA abundance (Fig. 2e and Sup- 
plementary Fig. 14a, b), but only a negligible increase at mRNA TSSs. 
This increasing trend is similar to that of PROMPT regions upstream 
of TSSs, although the increase of enhancer RNAs was significantly 
higher (P< 4.6 X 10 ©’, Mann-Whitney U test; Fig. 2e and Supplemen- 
tary Fig. 14b, c). Thus, the bidirectional transcriptional activity observed 
at enhancers is also present at promoters, as suggested previously”, 
but in promoters only the antisense RNA is degraded. Furthermore, 
the CAGE expression of enhancers in control and SKIV2L2 -depleted 
cells was proportional (Supplementary Fig. 14d), indicating that vir- 
tually all identified enhancers produce exosome-sensitive RNAs. The 
number of detectable bidirectional CAGE peaks increased 1.7-fold 
upon SKIV2L2 depletion and novel enhancer candidates had on average 
similar, but weaker, chromatin modification signals compared to control 
HeLa cells (Supplementary Fig. 14e). 


CAGE identifies cell-specific enhancer usage 


To test whether CAGE expression can identify cell-type-specific enhan- 
cer usage in vivo, ChIP-seq (H3K27ac and H3K4me1), DNA methyla- 
tion and triplicate CAGE analyses were performed in five primary 
blood cell types, and compared to published DHS data (http://www. 
roadmapepigenomics.org/, Supplementary Table 4). CAGE-defined 
enhancers were strongly supported by proximal H3K4me1/H3K27ac 
peaks (71%) and DHSs (87%) from the same cell type. Conversely, 
H3K4mel and H3K27ac supported only 24% of DHSs distal to pro- 
moters and exons and only 4% of DHSs overlapped CAGE-defined 
enhancers (Supplementary Fig. 15), indicating that a minority of pro- 
moter-distal DHSs identify enhancers. From the opposite perspective, 
only 11% of H3K4me1/H3K27ac loci overlapped CAGE-defined enhan- 
cers and untranscribed loci showed weaker ChIP-seq signals than 
transcribed ones (Supplementary Fig. 16). Moreover, there was a clear 
correlation between CAGE, DNase I hypersensitivity, H3K4mel and 
H3K27ac for CAGE-defined enhancers expressed in blood cells (Fig. 3a). 
Accordingly, cell-type-specific enhancer expression corresponds to cell- 
type-specific histone modifications (Fig. 3b). The majority of selected 
cell-type-specific enhancers could be validated in corresponding cell 
lines and were associated with cell-type-specific DNA demethylation 
(Supplementary Text, Supplementary Fig. 17 and Supplementary 
Tables 5-8, see also ref. 18). Thus, bidirectional CAGE pairs are robust 
predictors for cell-type-specific enhancer activity. 
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An atlas of transcribed enhancers across human cells 


The FANTOMS CAGE library collection® enables the dissection of 
enhancer usage across cell types and tissues comprehensively sampled 
across the human body. Clustering based on enhancer expression clearly 
grouped functionally related samples together (Fig. 3c and Supplemen- 
tary Figs 18 and 19). Although fetal and adult tissue often grouped 
together, two large fetal-specific clusters were identified: one brain- 
specific (pink) and one with diverse tissues (green). The fetal-brain 
cluster is associated with enhancers that are located close to known 
neural developmental genes, including NEUROG2, SCRT2, POU3F2 
and MEF2C (Supplementary Fig. 18b), for which gene expression pat- 
terns correlate with enhancer RNA abundance across libraries, suggest- 
ing regulatory interaction (see below). The results corroborate the func- 
tional relevance of these enhancers for tissue-specific gene expression 
and indicate that they are an important part of the regulatory programs 
of cellular differentiation and organogenesis. 

To confirm that candidate enhancers can drive tissue-specific gene 
expression in vivo, five evolutionarily conserved CAGE-defined human 
enhancers (including the POU3F2 and MEF2C-proximal enhancers 
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Figure 3 | CAGE expression identifies cell-type-specific enhancer usage. 

a, Relationship between CAGE and histone modifications in blood cells. Rows 
represent CAGE-defined enhancers that are ordered based on hierarchical 
clustering of CAGE expression. Columns for the CAGE tags (pink) represent 
the expression intensity for three biological replicates. DNase I hypersensitivity 
and H3K27ac and H3K4mel ChIP-seq signals + 1kb around the enhancer 
midpoints are shown in green, blue and orange, respectively. b, Mean signal of 
DNase-seq as well as ChIP-seq for H3K27ac and H3K4mel (vertical axes) per 
cell type (rows) in + 1kb regions (horizontal axes) around enhancer midpoints, 
for enhancers with blood-cell type-specific CAGE expression (columns). 

c, Dendrogram resulting from agglomerative hierarchical clustering of tissue 
samples based on their enhancer expression: each leaf of the tree represents one 
CAGE tissue sample (for a labelled tree and the corresponding results on 
primary cell samples, see Supplementary Figs 18 and 19). Sub-trees dominated 
by one tissue/organ type or morphology are highlighted. Some of the enhancers 
responsible for the fetal-specific subgroup in the larger brain sub-tree are 
validated in vivo (Fig. 4). 
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identified above) were tested via Tol2-mediated transgenesis in zebra- 
fish embryos. We observed tissue-specific enhancer activity with 3 of 5 
fragments, which corresponded to the human enhancer tissue express- 
ion (Fig. 4). None of three control fragments without CAGE signal 
activated the GATA2 promoter (Supplementary Table 9). Although 
the sample size is not high enough to reliably estimate the validation 
rates in zebrafish, the correlation between the enhancer usage profiles 
in zebrafish to those defined in human by CAGE is notable. 

We grouped the primary cell and tissue samples into larger, mutu- 
ally exclusive cell type and organ/tissue groups (referred to as facets), 
respectively, with similar function or morphology (Supplementary 
Tables 10 and 11). Figure 5 summarizes how many enhancers were 
detected in each facet and the degree of facet-specific CAGE express- 
ion (see also Supplementary Fig. 21). From the data we can draw 
several conclusions: 

First, the majority of detected enhancers within any facet are not 
restricted to that facet. Exceptions, where facets use a higher fraction 
of specific enhancers, include immune cells, neurons, neural stem 
cells and hepatocytes amongst the cell-type facets, and brain, blood, 
liver and testis amongst the organ/tissue facets. 

Second, despite their apparent promiscuity, enhancers are more 
generally detected in a much smaller subset of samples than mRNA 
transcripts (Supplementary Figs 21 and 22a, b), consistent with cell- 
line studies’ and the higher specificity of ncRNAs in general’’. Facets 
in which we detect many enhancers typically also have a higher frac- 
tion of facet-specific enhancers (Supplementary Fig. 22c, d). 

Third, the number of detected expressed enhancers and mRNA 
transcripts is correlated (Supplementary Fig. 21b), but the number 
of detected expressed gene transcripts (>1 tag per million mapped reads 
(TPM)) is 19-34 fold larger than the number of detected enhancers 
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Figure 4 | In vivo validation in zebrafish of tissue-specific enhancers. 
Validations of in vivo activity of CAGE-defined human enhancers CRE1, CRE2 
and CRE3 in zebrafish embryos at long-pec stage. Each panel shows, from left to 
right, representative yellow fluorescent protein (YFP) and bright field images of 
embryos injected with the human enhancer gata2 promoter reporter gene 
construct (left), YFP zoom-ins (middle) and CAGE expression in TPM in 
human tissues/cell types for the enhancer (right). Muscle (mu) and yolk 
syncytial layer (ysl) activities are background expression coming from the gata2 
promoter-containing reporter construct. All images are lateral, head to the left. 
Note the correspondence between zebrafish and human enhancer usage/ 
expression. Supplementary Fig. 20 shows UCSC browser images of each 
selected enhancer. a, CRE1, ~230kb upstream of the MEFC2 gene, drives 
highly robust expression in the brain (brain) and neural tube (nt). Right panel 
gives zoom-in overlay image showing expression in the forebrain (fb), midbrain 
(mid), hindbrain (hin) and spinal cord (sp). b, CRE2, 5 kb upstream of the 
POU3E2 gene, is active in the floor plate (fp). c, CRE3, 10 kb upstream of the 
SOX7 gene TSS, shows specific expression in the vasculature (including 
intersegmental vessels (iv), dorsal vein (dv) and dorsal aorta (da). 
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with the cut-offs used. Noteworthy exceptions include blood and immune 
cells, testis, thymus and spleen, which have high enhancer/gene ratios. 
Conversely, smooth and skeletal muscle and skin, bone and epithelia- 
related cells have low ratios. Differential exosome activity between cell 
types might affect these results, but there was no correlation between 
SKIV2L2 mRNA expression and the number of enhancers detected 
(Supplementary Fig. 22e, f). 

As expected, consensus motifs of known key regulators are over- 
represented in corresponding facet-specific enhancers, for instance 
ETS, C/EBP and NF-«B in monocyte-specific enhancers, RFX and 
SOX in neurons, and HNF1 and HNF4a in hepatocytes (Supplemen- 
tary Fig. 23). Notably, the AP1 motif appears to be enriched across all 
facets, perhaps associated with a general role for AP1 in regulating 
open chromatin”. 


Expression clustering reveals ubiquitous enhancers 


Hierarchical clustering of enhancers by facet expression revealed a 
small subset of enhancers (200 or 247, defined by primary cell or tissue 
facets, respectively) expressed in the large majority of facets (Sup- 
plementary Text, Supplementary Figs 24 and 25, and Supplemen- 
tary Tables 12 and 13). Compared to other enhancers, these ubiquitous 
(u-) enhancers are 8 times more likely to overlap CGIs and they are 
twice as conserved (Supplementary Fig. 26a—c). U-enhancers overlap 
typical chromatin enhancer marks but have higher H3K4me3 signal 
(Supplementary Fig. 26d). Although they produce significantly longer 
ncRNAs than other enhancers (median 530 nucleotides, P< 1.5 X 10 8, 
Mann-Whitney U test), the transcripts remain predominantly (~78%) 
unspliced and significantly shorter (P< 4.2 X 10° '8, Mann-Whitney 
U test) than mRNAs (Supplementary Figs 27 and 28), do not share 
exons with known genes, and are exosome-sensitive (Supplementary 
Fig. 14b). Therefore, it is unlikely that these are novel mRNA promo- 
ters. They are also highly enriched for P300 and cohesin ChIP-seq 
peaks” and RNAPII-mediated ChIA-PET signal’? compared to other 
enhancers (Supplementary Fig. 26d). These results indicate that 
u-enhancers comprise a small but distinct subset of enhancers, which 
probably has specific regulatory functions used by virtually every 
human cell. 


Linking enhancer usage with TSS expression 


A major challenge is to link enhancers to their target genes 
Uniquely, FANTOMS5 CAGE allows for direct comparison between 
transcriptional activity of the enhancer and of putative target gene 
TSSs across a diverse set of human cells. Based on pairwise expression 
correlation, nearly half (40%) of the inferred TSS-associated enhan- 
cers (Methods) were linked with the nearest TSS, and 64% of enhan- 
cers have at least one correlated TSS within 500 kilobases. Several 
associations (10,260; 15.3%) are supported by ChIA-PET (RNAPII- 
mediated) interaction data”, and the supported fraction increases 
with the correlation threshold (Supplementary Fig. 29a). The fraction 
of supported associations is 4.8-fold higher than that of associations 
predicted from DNase I hypersensitivity correlations’® (20.6% versus 
4.3%, at the same correlation threshold), indicating that transcription 
is a better predictor of regulatory targets than chromatin accessibility. 
Conserved sequence motifs and ChIP-seq peaks also co-occurred 
significantly in associated enhancer-promoter pairs (Benjamini- 
Hochberg false discovery rate (FDR) < 0.05, binomial test), suggest- 
ing an additive or synergistic cooperation between enhancers and 
promoters at RNAPII foci. 

On average, a RefSeq TSS was associated with 4.9 enhancers and an 
enhancer with 2.4 TSSs and we observed different regulatory archi- 
tectures around genes (Supplementary Fig. 30). For example, at the 
beta-globin locus the CAGE expression patterns of four locus control 
region hypersensitive sites are highly correlated (Pearson’s r between 
0.88 and 0.98) with the expression of known target genes**** HBG2 
and HBD, and to some extent HBG1. 
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; Figure 5 | Enhancer usage and 
specificity in groups of cells. The 
upper panel gives the number of 
detected enhancers per million 
CAGE tags within each group (facet) 
of related cell type libraries. The 
expression specificity of the 
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enhancers is shown as a heat map in 
the panel below. Colours show the 
fraction of expressed enhancers 

in each facet (columns) that are in 
each specificity range (rows). For 
corresponding plots on organ/tissue 


facets and genes, see Supplementary 


These observations call for computational models of enhancer regu- 
lation, in which multiple enhancers may work in concert to enhance 
the expression of a gene. To this end, we focused on 2,206 RefSeq TSSs 
for which the joint expression of nearby enhancers (the closest ten 
enhancers within 500 kb) is highly predictive of the gene expression. 
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Model shrinkage showed that in most cases, only one to three enhancers 
are necessary to explain the expression variance observed in the linked 
gene, and generally proximal enhancers are more predictive than distal 
ones (Fig. 6a, Supplementary Fig. 29b-d and Supplementary Text). One 
hypothesis explaining the function of multiple enhancers driving the 
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Figure 6 | Linking enhancers to TSSs and disease-associated SNPs. a, The 
proportional contribution (see Methods) of the 10 most proximal enhancers 
within 500 kb of a TSS in a model explaining gene expression variance (vertical 
axis) as a function of enhancer expression. x axis indicates the position of the 
enhancer relative to the TSS: 1 the closest, etc. Bars indicate interquartile ranges 
and dots medians. b, Relationship between the number of highly correlated 
(‘redundant’) enhancers per locus (horizontal axis) and the maximal expression 
(TPM) of the associated TSS in the same model over all CAGE libraries (vertical 
axis). Error bars as in a. c, GWAS SNP sets preferentially overrepresented 
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within enhancers, exons and mRNA promoters. Observed and expected 
overlaps are shown above bars. The vertical axis gives enrichment odds ratios. 
The horizontal axis shows GWAS traits or diseases. d, Diseases with 
GWAS-associated SNPs over-represented in enhancers of certain expression 
facets. The horizontal axis gives the odds ratio as in panel c, broken up by 
expression facets: each point represents the odds ratio of GWAS SNP 
enrichment for a disease (vertical axis) in a specific expression facet. Summary 
annotations of point clouds are shown. See also Supplementary Fig. 31. 
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same expression pattern is that they might confer higher transcriptional 
output of a gene*”®. Indeed, the number of highly correlated (redund- 
ant) enhancers close to TSSs (Supplementary Methods) increased with 
the observed maximal TSS expression over all libraries (Fig. 6b), imply- 
ing that these enhancers are redundant in terms of transcription patterns 
but additive in terms of expression strength. Expression redundancy is 
also common in genomic clusters of closely spaced enhancers (24% of 
815 identified genomic clusters, Supplementary Table 15). These are 
associated with TSSs of genes involved in immune and defence responses 
and, as suggested by a previous study”, have a higher expression than 
other enhancer-associated genes (eightfold increase on average). 


Disease-associated SNPs are enriched in enhancers 


Many disease-associated SNPs are located outside of protein-coding 
exons and a large proportion of human genes display expression 
polymorphism”*. Using the NHGRI genome-wide association studies 
(GWAS) catalogue” and extending the compilation of lead SNPs with 
proxy SNPs in strong linkage disequilibrium (similar to refs 30, 31), 
we identified diseases/traits whose associated SNPs overlapped enhan- 
cers, promoters, exons and random regions significantly more than 
expected by chance (Fisher’s exact test P< 0.01, Supplementary Table 16). 
Disease-associated SNPs were over-represented in regulatory regions 
to a greater extent than in exons (Fig. 6c). For many traits where enriched 
disease-associated SNPs were within enhancers, enhancer activity was 
detected in pathologically relevant cell types (Fig. 6d and Supplementary 
Figs 31 and 32). Examples include Graves’ disease-associated SNPs 
enriched in enhancers that are expressed predominantly in thyroid 
tissue, and similarly lymphocytes for chronic lymphocytic leukaemia. 
Asa proof of concept, we validated the impact of two disease-associated 
regulatory SNPs within enhancers (Supplementary Fig. 33). 


Conclusions 


The data presented here demonstrate that bidirectional capped RNAs, 
as measured by CAGE, are robust predictors of enhancer activity in a 
cell. Transcription is only measured at a fraction of chromatin-defined 
enhancers and few untranscribed enhancers show potential enhancer 
activity. This implies that many chromatin-defined enhancers are not 
regulatory active in that particular cellular state, but may be active in 
other cells of the same lineage” or are pre-marked for fast regulatory 
activity upon stimulation’. Of course, given the relative instability of 
enhancer RNAs some chromatin-defined sites may be active but fall 
below the limits of detection of CAGE. 

Our results show that position-specific sequence signals upstream 
of the transcription initiation sites and the production of small, 
uncapped RNAs immediately downstream is present at both enhan- 
cers and mRNA promoters, suggesting similar mechanisms of ini- 
tiation. Previous studies (for example refs 10, 34, 35) suggested that 
promoters and enhancers differ in motif composition. This view is not 
supported by the larger FANTOMS data set. Instead, the differences 
reflect the local G+C content because transcribed enhancers tend to 
harbour low G+C content motifs like non-CGI promoters. Features 
distinguishing enhancers from mRNA promoters are (1) enhancer RNAs 
are exosome-sensitive regardless of direction whereas (sense) mRNAs 
have a longer half-life than their antisense counterpart; (2) enhancer 
RNAs are short, unspliced, nuclear and non-polyadenylated and (3) 
enhancers have downstream polyadenylation and 5’ splice motif fre- 
quencies at genomic background level similar to antisense PROMPTs, 
whereas mRNAs are depleted of termination signals and enriched for 
5’ splice sites'’”. 

The collection of active enhancers presented here provides a resource 
that complements the activity of the ENCODE consortium’ across a 
much greater diversity of tissues and cellular states. It has clear appli- 
cations in human genetics, to narrow the search windows for func- 
tional association, and for the definition of regulatory networks that 
underpin the processes of cellular differentiation and organogenesis in 
human development. 
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METHODS SUMMARY 


Single-molecule HeliScopeCAGE data was generated as described elsewhere’. 
Sequencing and processing of ribosomal RNA-depleted RNAs, short RNAs and 
H3K27ac or H3K4mel ChIPs as well as the processing of publicly available 
DNase-seq data are described in the Methods. 

Putative enhancers were identified from bidirectionally transcribed loci having 
divergent CAGE tag clusters separated by at most 400 bp (described in Sup- 
plementary Fig. 6a). We required loci to be divergently transcribed in at least 
one FANTOMS sample, defined by CAGE tag 5’ ends within 200 bp divergent 
strand-specific windows immediately flanking the loci midpoints. The expression 
of each enhancer in each FANTOMS sample was quantified as the normalized 
sum of strand-specific sums of CAGE tags in these windows. A sample-set wide 
directionality score, D, for each locus over aggregated normalized reverse, R, and 
forward, F, strand window-expression values across all samples, D = (F — R)/ 
(F + R), were then used to filter putative enhancers to have low, non-promoter- 
like, directionality scores (|D| < 0.8). Further filtering ensured enhancers to be 
located distant to TSSs and exons of protein- and noncoding genes. 

Motif enrichment analyses were done using HOMER”. Regulatory targets of 
enhancers were predicted by correlation tests using the sample-set wide express- 
ion profiles of all enhancer-promoter pairs within 500 kb. The regulatory effects 
of multiple enhancers were modelled using linear regression followed by lasso- 
based model-shrinkage”’. 

Enhancer activity was tested in vivo in zebrafish embryos using Tol2-mediated 
transgenesis**. Expression patterns were documented at 48 h post fertilization 
using >200 eggs per construct. Large-scale in vitro validations on randomly 
selected enhancers were performed using firefly/Renilla luciferase reporter plas- 
mids with enhancer sequences cloned upstream of an EFla basal promoter 
separated by a synthetic polyA signal/transcriptional pause site in a modified 
pGL4.10 (Promega) vector (Supplementary Fig. 9d). Full details are provided 
in the Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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A promoter-level mammalian 


expression atlas 


The FANTOM Consortium and the RIKEN PMI and CLST (DGT)* 


Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell 
types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and 
their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian 
gene expression across the human body. We find that few genes are truly ‘housekeeping’, whereas many mammalian 
promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expres- 
sion profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes 
are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links 
them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample 
ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOMS) project provides com- 
prehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide 


applications in biomedical research. 


The mammalian genome encodes the instructions to specify develop- 
ment from the zygote through gastrulation, implantation and genera- 
tion of the full set of organs necessary to become an adult, to respond 
to environmental influences, and eventually to reproduce. Although 
the genome information is the same in almost all cells of an individual, 
at least 400 distinct cell types’ have their own regulatory repertoire of 
active and inactive genes. Each cell type responds acutely to alterations 
in its environment with changes in gene expression, and interacts with 
other cells to generate complex activities such as movement, vision, mem- 
ory and immune response. 

Identities of cell types are determined by transcriptional cascades that 
start initially in the fertilised egg. In each cell lineage, specific sets of 
transcription factors are induced or repressed. These factors together 
provide proximal and distal regulatory inputs that are integrated at tran- 
scription start sites (TSSs) to control the transcription of target genes. 
Most genes have more than one TSS, and the regulatory inputs that 
determine TSS choice and activity are diverse and complex (reviewed 
in ref. 2). 

Unbiased annotation of the regulation, expression and function of 
mammalian genes requires systematic sampling of the distinct mam- 
malian cell types and methods that can identify the set of TSSs and tran- 
scription factors that regulate their utilization. To this end, the FANTOM5 
project has performed cap analysis of gene expression (CAGE)? across 
975 human and 399 mouse samples, including primary cells, tissues 
and cancer cell lines, using single-molecule sequencing? (Fig. 1; see the 
full sample list in Supplementary Table 1). 

CAGE libraries were sequenced to a median depth of 4 million mapped 
tags per sample (Supplementary Methods) to produce a unique gene 
expression profile, focused specifically on promoter utilization. CAGE 
has advantages over RNA-seq or microarrays for this purpose, because 
it permits separate analysis of multiple promoters linked to the same 
gene’*. Moreover, we show in an accompanying manuscript* that the 
data can be used to locate active enhancers, and to provide numerous 
insights into cell-type-specific transcriptional regulatory networks 
(see the FANTOMS website http://fantom.gsc.riken.jp/5). The data 
extend and complement the recently published ENCODE? data, and 


microarray-based gene expression atlases® to provide a major resource 
for functional genome annotation and for understanding the transcrip- 
tional networks underpinning mammalian cellular differentiation. 


The FANTOMS promoter atlas 


Single molecule CAGE profiles were generated across a collection of 
573 human primary cell samples (~ 3 donors for most cell types) and 
128 mouse primary cell samples, covering most mammalian cell steady 
states. This data set is complemented with profiles of 250 different cancer 
cell lines (all available through public repositories and representing 154 
distinct cancer subtypes), 152 human post-mortem tissues and 271 mouse 
developmental tissue samples (Fig. 1a; see the full sample list in Sup- 
plementary Table 1). To facilitate data mining all samples were anno- 
tated using structured ontologies (Cell Ontology’, Uberon®, Disease 
Ontology’). The results of all analyses are summarized in the FANTOM5 
online resource (http://fantom.gsc.riken.jp/5). We also developed two 
specialized tools for exploration of the data. ZENBU, based on the 
genome browser concept, allows users to interactively explore the rela- 
tionship between genomic distribution of CAGE tags and expression 
profiles’®. SSTAR, an interconnected semantic tool, allows users to explore 
the relationships between genes, promoters, samples, transcription 
factors, transcription factor binding sites and coexpressed sets of pro- 
moters. These and other ways to access the data are described in more 
detail in Supplementary Note 1. 


CAGE peak identification and thresholding 


To identify CAGE peaks across the genome we developed decomposition- 
based peak identification (DPI; described in Supplementary Methods; 
Extended Data Fig. 1). This method first clusters CAGE tags based on 
proximity. For clusters wider than 49 base pairs (bp) it attempts to 
decompose the signal into non-overlapping sub-regions with different 
expression profiles using independent component analysis’. Sample- 
and genome-wide, DPI identified 3,492,729 peaks in human and 2,088,255 
peaks in mouse. To minimize the fraction of peaks’ that map to internal 
exons (which could exist due to post-transcriptional cleavage and recap- 
ping of RNAs”), and enrich for TSSs, we applied tag evidence thresholds 


*Lists of participants and their affiliations appear at the end of the paper. 
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Figure 1 | Promoter discovery and definition in FANTOMS. a, Samples 
profiled in FANTOMS. b, Reproducible cell-type-specific CAGE patterns 
observed for the 266 base CpG island associated B4GALT1 locus transcription 
initiation region hg19:chr9:33167138..33167403. CAGE profiles for CD4* T 
cells (blue), CD14 monocytes (gold), aortic smooth muscle cells (green) and 
the adrenal cortex adenocarcinoma cell line SW-13 (red) are shown. A 
combined pooled profile showing TSS distribution across the entire human 
collection is shown in black. Values on the y axis correspond to maximum 
normalized TPM for a single base in each track. c, Decomposition-based peak 
identification (DPI) finds 6 differentially used peaks within this composite 
transcription initiation region (note: peaks are labelled from p1@B4GALT1 


to define robust and permissive subsets (described in more detail in 
Supplementary Methods and summarized in Table 1). Specifically the 
robust threshold, which is used for most of the analyses presented here, 
enriched for peaks at known 5’ ends compared to known internal 
exons by twofold (that is, two-thirds of the peaks hitting known full- 
length transcript models hit the 5’ end). A flow diagram showing the 
relationship between samples, peaks, thresholding and subsets used in 
each analysis is provided in the Supplementary Figure 1. Supporting 
evidence that the peaks are genuine TSSs, based upon support from 
expressed sequence tags (ESTs), histone H3 lysine 4 trimethylation 
(H3K4Me3) marks and DNase hypersensitive sites is provided in Sup- 
plementary Note 2. 

Figure 1b illustrates the 266 bp spanning transcription initiation 
region of B4GALT1, where 6 independent robust peaks were identified 
by DPI, each with a unique regulatory pattern (Fig. 1c). A total of 58% 
of human and 56% of mouse robust peaks occur in such composite 
transcription initiation regions, defined as clusters of robust peaks within 
100 bases of each other. More than half of these contain peaks with 
statistically significant differences in expression profiles (63% of human 
and 54% of mouse composite transcription initiation regions; likelihood 
ratio test, false discovery rate (FDR) <1%, Extended Data Fig. 1d). Sup- 
plementary Tables 2 and 3 summarize public domain EST evidence 
that these independent peaks contained within composite transcrip- 
tion initiation regions give rise to long RNAs. 


p6@B4GALT1 


p3@B4GALT1 p1@B4GALT1 


with most tag support through to p7@B4GALT1 with the least tag support; 
p4@B4GALT1 is not shown and is in the 3’ UTR of the locus at position 
hg19::chr9:33111241..33111254—). Note in particular one large broad region 
on the left used in all samples and a sharp peak to the right, preferentially used 
in the aortic smooth muscle cells. d, Venn diagram showing DPI defined peaks 
expressed at = 10 TPM in primary cells (red), tissues (blue) and cell lines 
(green). e, Fraction of unannotated peaks observed in subsets of d. P, primary 
cells, T, tissues, C, cell lines, PT, TC, PC and PTC correspond to peaks found 
in multiple sample types, for example, PT, found in primary cells and 

tissue samples. 


Known gene coverage in FANTOM5 


To provide annotation of the CAGE peaks, the distance between indi- 
vidual peaks and the 5’ ends of known full-length transcripts was deter- 
mined and then peaks within 500 bases of the 5’ end of known transcript 
models were assigned to that gene (see Supplementary Methods, Table 1). 
To provide names for each TSS region, peaks identified at the permis- 
sive threshold were ranked by the total number of tags supporting each 
and then sequentially numbered (for example, p1@GFAP corresponds 
to the promoter of GFAP which has the highest tag support). From 
these annotations, TSS for 91% of human protein coding genes (as 
defined by the HUGO Gene Nomenclature Committee) were supported 
by robust CAGE peaks, and 94% at the permissive threshold (Sup- 
plementary Note 3). The atlas also detected signals from the promoters 
of short RNA primary transcripts, and long non-coding RNAs. In 
comparison to the previous FANTOM3 and 4 projects, FANTOMS5 
measured expression at an additional 4,721 human and 5,127 mouse 
RefSeq genes. The inclusion of primary cells, cell lines and tissues in 
the atlas provided greater coverage than any of the sample types alone 
(Fig. 1d) and the primary cell samples in particular were a rich source 
of unannotated peaks (Fig. le). 


Mammalian promoter architectures 


Mammalian promoters can be classified as broad or sharp types, based 
upon local spread of TSSs along the genome’’. The FANTOMS data 
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Table 1 | Summary of peaks, coverage and genes hit in FANTOM5 


Human Mouse 
Peaks Stranded genome Number of Genes Peaks Peaks Stranded genome Number of Genes Peaks 
coverage (bp) aligned reads hit per coverage (bp) aligned reads hit per 
gene gene 
The whole = 6.2x109 100% 45x102 100% — = = 5.3x102 100% 19x109 100% — — 
genome 
‘Permissive’ 1,048,124 14x10’ 0.22% 3.6x10° 80% 20808 — 652,860 84x10° 0.16% 15x10? 79% 20,480 _ 
CAGE peaks 
(A) Within 245,514 43%x10° 007% 3.0x10° 68% 20808 11.8 146185 25x10° 0.05% 13x10? 69% 20,80 7.1 
500 bp of 
annotated 5’ 
(B) TSS 217,572 40x10° 006% 29x10° 64% 18503 — 129,466 24x10® 0.05% 10x10? 52% 17,088 - 
classifier 
positive 
(Aor B) Likely 308,214 5.3x10° 009% 32x109 72% 20808 — 173,564 30x10 0.06% 14x10? 70% 20,480 — 
TSS 
‘Robust’ CAGE 184,827. 3.9x10° 0.06% 3.5x10° 77% 8961 — 116,277. 25x10 0.05% 14x10? 75% 19,001 = 
peaks 
(A) Within 82,150 22x10 0.04% 3.0x10° 66% 8961 43 61,134 16x10 003% 13x10? 68% 19,001 3.2 
500bp of 
annotated 5’ 
(B) TSS 76445 21%x10° 0.03% 29x10° 63% 7,285 — 51,611 1.4X10° 0.03% 9.9x10® 51% 16,028 = 
classifier 
positive 
(Aor B) Likely 92,783 24x10®© 004% 3.2 x10? 70% 8961 — 77674 1.7X10° 0.03% 13x10? 69% 19,001 = 
TSS 
Cross-species 70,351 1.6X10° 0.03% 105,157 24x10° 0.04% 
projected 
robust peaks 
'Homologous’ 34,041 1.0x10° 0.02% 42,423, 13%x10° 0.02% 


robust peaks 


confirmed this general observation (Extended Data Fig. 2), however, 
for the first time the greater depth of sequencing enabled identification 
of the preferred TSS within broad promoters. Taking each library in 
turn, using the location of the dominant TSS (that is, the TSS with the 
highest number of tags), we searched for phased WW dinucleotides 
(AA/AT/TA/TT) associated with nucleosome location (Extended Data 
Fig. 2). Remarkably, on a genome-wide scale, there was a periodic spacing 
of WW motifs with a 10.5 bp repeat downstream of the dominant TSS, 
exactly as shown previously for well-phased H2A.Z nucleosomes 
(Extended Data Fig. 2d). The precise phasing was supported further 
by the pattern of H2A.Z and H3K4me3 chromatin immunoprecipita- 
tion sequencing (ChIP-seq) signal seen around TSS in CD14* mono- 
cytes and frontal lobe respectively (Extended Data Fig. 2e, f). This 
observation indicates that the positioned nucleosome is a key indicator 
of start site preference in broad promoters. 


Expression levels and tissue specificity 


The raw tag counts under the DPI peak coordinates were used to gen- 
erate an expression table across the entire collection. Normalized tags 
per million (TPM) were then calculated using the relative log expres- 
sion (RLE) method in edgeR**. Almost all peaks (96%) were repro- 
ducibly detected above 1 TPM in at least two samples, but most were 
detected in less than half the samples. Examining the distribution of 
expression level and breadth across the collection, we classified the 185K 
robust human peak expression profiles as non-ubiquitous (cell-type- 
restricted, 80%), ubiquitous-uniform (‘housekeeping’, 6%) or ubiquitous- 
non-uniform (14%) (Fig. 2a, b). We define ubiquitous as detected in 
more than 50% of samples (median >0.2 TPM) and uniform as a less 
than tenfold difference between maximum and median expression. Esti- 
mation using the smaller mouse expression data set or human primary 
cell, cell line or tissue data subsets resulted in different fractions, yet in 
all cases ubiquitous-uniform expression profiles were in the minority 
(Extended Data Fig. 3a—e). Alternative measures such as richness index 
and Shannon entropy confirm that only a minor fraction of transcripts 
can be considered as genuine housekeeping genes with broad and uniform 
expression (Supplementary Note 4 and Supplementary Table 4 for a 
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list of housekeeping genes). In addition many of the 1,225 known genes 
that were missed in the collection are known to be specifically expressed 
in cell types that are not easily procured; indicating that even more of 
the mammalian transcriptome has a cell-type-restricted expression 
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Figure 2 | Cell-type-restricted and housekeeping transcripts encoded in the 
mammalian genome. a, Density plot summarizing the distribution of relative 
log expression (RLE) normalized maximum and median TPM expression 
values for the 185K robustly detected human peaks identified by FANTOM5 
(colour bar on right indicates relative density). Box and whiskers plots above 
and to right show distribution of median and maximum values in the data 
set (box shows the interquartile range). Promoters of named genes are 
highlighted to show extremes of expression level and expression breadth, note 
the alternative promoters of IRF9 and TRMT5 have different maximums and 
breadths of expression (see Extended Data Fig. 10). Fraction on left of the 
red vertical dashed line corresponds to peaks detected in less than 50% of 
samples with non-ubiquitous (cell-type-restricted) expression patterns 
(median <0.2 TPM). Fraction below the red diagonal dashed line corresponds 
to ubiquitous-uniform (housekeeping) expression profiles (maximum 

<10X median). Fraction above diagonal and to the right of the vertical dashed 
lines corresponds to ubiquitous-non-uniform expression profiles (maximum 
>10X median). b, Box and whisker plots showing the distribution of 
expression levels for the same peaks as in a across the 889 samples (box shows 
the interquartile range). 
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pattern (Supplementary Note 3). In overview, the data confirm the argu- 
ment that most genes are regulated in a tissue-dependent manner’®. 
According to Gene Ontology enrichment analysis’ of genes within 
each of the three classes (Supplementary Table 5), the non-ubiquitous 
genes were enriched for proteins involved in cell-cell signalling, plasma 
membrane receptors, cell adhesion molecules and signal transduction, 
whereas genes in the housekeeping set were enriched for components of 
the ribonucleoprotein complex and RNA processing. The ubiquitous- 
non-uniform set was enriched for cell cycle genes, with 204 of the 268 
human genes annotated with the ‘mitotic cell cycle’ term, a reflection of 
the fact that the fraction of actively proliferating cells inevitably varies 
greatly across the collection. 

Finally, of the 104,859 peaks expressed at 10 TPM (~3 copies per 
cell’*) or greater, an average primary cell sample expressed a median of 
8,757 including peaks for 430 transcription factor mRNAs (Extended 
Data Fig. 3f, g). 


Promoter conservation between human and mouse 


Regulatory regions such as transcription factor binding sites are often, 
but not always, located in conserved and orthologous regions’”. Overall 
human TSSs were significantly enriched in evolutionarily conserved 
regions compared to the genome-wide null expectation, with 38% over- 
lapping previously defined mammalian constrained elements (Fisher’s 
exact test, odds ratio 10.2, P value < 2.2 X 10 '® see Supplementary 
Methods). Despite this general level of conservation, there is evidence 
of extensive evolutionary remodelling of transcription initiation. For 
example, 43% (79,670 out of 184,476) of human TSSs could not be aligned 
to the mouse genome, and 39% (45,926 out of 116,277) of mouse TSSs 
could not be aligned to the human genome (Supplementary Methods). 
Alignment between species decayed as a function of neutral sequence 
divergence (Fig. 3). Housekeeping TSSs showed highest TSS conser- 
vation, whereas the TSSs of non-coding RNAs were less conserved than 
those of protein-coding TSSs. Indeed, the alignment of promoters of 
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Figure 3 | TSS conservation as a function of expression properties and 
functional annotation. a, b, Human robust TSS coordinates were projected 
through EPO12 whole genome multiple sequence alignments (Supplementary 
Methods). The y-axis values show the fraction of human TSSs that align to an 
orthologous position in the indicated species. The x axis shows the relative 
divergence of macaque, dog and mouse genomes as the substitution rate at 
fourfold degenerate sites in protein coding sequence. The TSS locations were 
genome permuted (Supplementary Methods) and then projected through 
EPO12 alignments to give the null expectation (dashed blue line). The 95% 
confidence intervals of 1,000 samples of 1,000 TSS are shown (blue shading). 
a, TSS mapped to the 5’ ends of protein coding and non-coding transcripts are 
labelled (C and N, respectively), those that do not map to a known transcript 
5’ end are shown as the ‘anonymous’ category. With the exception of 
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broadly expressed non-coding transcripts was not greatly different 
from randomly selected genomic sites (Fig. 3a). However, it is impor- 
tant to note that the random permutations inevitably overlap constrained 
elements, so cannot be considered representative of neutral evolution. 

TSSs that were highly-restricted or biased in their expression to a 
single cell type or tissue were more likely to be gained or lost through 
evolution (Fig. 3a). TSSs preferentially expressed in fibroblasts, chon- 
drocytes and pre-adipocytes were among the most conserved, whereas 
those enriched in T-cells, macrophages, dendritic cells, whole blood and 
endothelial cells were the most likely to be gained or lost (Fig. 3b). This 
suggests a more rapidly evolving immune system. It also suggests con- 
tributions of relaxed constraint and positive selection to the remodelling 
of transcription initiation through the insertion and deletion of pro- 
moter sequences. 

To enable comparative analysis, we projected the expression patterns 
from one species to the other (Extended Data Fig. 4) and provide the 
peak position and orthologous expression profile through a cross-species 
track in ZENBU™. Only 54% and 61% of human and mouse conserved 
TSSs (of protein coding genes) had an orthologous peak in the other 
species. This increased to 61% and 63% respectively for TSSs from well 
matched samples (for example, human and mouse hepatocytes), how- 
ever, surprisingly, almost 40% of conserved TSS do not appear to be 
used even in the matched cells (Supplementary Table 6). 


Features of cell-type-specific promoters 

Carrying out a systematic de novo motif discovery analysis in cell-type- 
specific promoters, recovered motifs similar to the binding motifs of 
transcription factors known to be relevant to the corresponding cel- 
lular states (Extended Data Fig. 5a—c and described in Supplementary 
Note 5). Examining general promoter features many CpG island (CGI) 
based promoters (54%) and most non-CGI-non-TATA promoters (92%) 
had non-ubiquitous expression profiles (Extended Data Fig. 3k-n). 
Although CGI promoters are generally associated with housekeeping 
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anonymous, all robust TSSs represented in both panels are associated with the 
5’ ends of previously annotated transcripts. Non-ubiquitous (cell-type- 
restricted), ubiquitous-uniform (housekeeping) and non-uniform-ubiquitous 
were defined as in Fig. 2. Ultra-housekeeping TSSs were defined as those with 
less than fivefold difference between maximum and median. The category top 
1000 UDE represents the 1,000 ubiquitous TSSs that are most differentially 
expressed’. There are 1,016 ultra-housekeeping TSSs, 276 ubiquitous-uniform 
non-coding TSSs and all other categories contain over 2,000 TSSs. b, Same 
axes as panel a showing TSSs with expression that is biased towards a single 
expression facet (larger mutually exclusive grouping of the primary cell and 
tissue samples based on the sample ontologies CO and UBERON, defined in 
ref. 4). Only expression facets with greater than 250 enriched TSSs are shown. 
For clarity, only a subset of expression facets are coloured and labelled. 
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genes, we observed a subset with highly cell-type-restricted expression 
profiles (right tail of Extended Data Fig. 6a). Examining CGI and non- 
CGI promoters separately we find that cell-type-specific promoters of 
both classes were enriched for binding of cell-type-specific transcription 
factors (evidenced by over-representation of motifs and bound sites in 
public ChIP-seq data sets). For the human hepatocellular carcinoma 
cell line HepG2 we observed enrichment of liver-specific transcription 
factors (HNF4, FOXA2, and TCF7L2) at both CGI and non-CGI HepG2 
specific promoters (Extended Data Fig. 6b, c; similar examples are shown 
in Extended Data Figs 5d and 7). As noted in the accompanying analysis’, 
both cell-type-specific CGI and non-CGI promoters tend to have prox- 
imal high-specificity enhancers (Extended Data Fig. 6d). This indicates 
that specific expression at CGI promoters uses the same type of signals 
as non-CGI promoters: proximal transcription factor motifs and high- 
specificity enhancers. 

Of note, a small number of highly abundant RNAs account for 20% 
or more of the reads in some libraries: HBB, SMR3B, STATH, PRB4, 
CLPS, HTN3, SERPINA1, CTRB2, CPB1, CPA and MALAT1. Although 
the abundance of these transcripts is a function of their relatively stability 
as well as rate of initiation, a modest but significant over representation 
of ETS and YY1 sites was found in highly expressed promoters com- 
pared to weakly expressed ones (Extended Data Fig. 5g). Although the 
different motif composition may contribute to expression levels, the 
accompanying manuscript* shows that arrays of enhancers with similar 
usage” probably contribute to the higher maximal expression rate. 


Key cell-type-specific transcription factors 


Among 1,762 human and 1,516 mouse transcription factors compiled 
from the literature’, promoter level expression profiles for 1,665 human 
transcription factors (94%) and 1,382 mouse transcription factors (91%) 
were obtained (Supplementary Tables 7, 8 and 9 and Supplementary 
Note 6). The distribution of expression levels and cell-type or tissue- 
specificity of transcription factors (Extended Data Fig. 3f-j) and the 
number of robust promoter peaks per transcription factor gene was 
similar to coding genes in general (4.8 compared to 4.6). In any given 
primary cell type, a median of 430 (306 to 722) transcription factors 
were expressed at 10 TPM or above (~3 copies per cell based on 300,000 
mRNAs per cell'*) (Extended Data Fig. 3g). 

Clustering transcription factors by expression profile revealed sets 
of transcription factors specifically enriched in each cell type (Extended 
Data Fig. 8). For each primary cell sample we have made available ranked 
lists of transcription factors based on their promoter expression in the 
sample relative to the median across the collection (http://fantom.gsc. 
riken.jp/5/sstar/Browse_samples). For most cell types we found one 
transcription factor that was very highly enriched (= 100-fold), 23 highly 
enriched transcription factors (= tenfold) and 82 moderately enriched 
transcription factors (= fivefold) (numbers of transcription factors 
are based on median number of transcription factors observed at each 
enrichment threshold across the primary cell samples). To demonstrate 
their likely relevance we systematically reviewed phenotypes of tran- 
scription factor knockout mice at the MGI (see Supplementary Note 7). 
The clear connection between tissue-specific expression profiles and 
relevant knockout phenotypes is summarized in Supplementary Table 10. 
For example, in mouse inner ear hair cells, knockout of six of the top 20 
most enriched transcription factor genes in mouse (Pou3f4 (ref. 24), 
Sox2 (ref. 25), Egr2, Six1 (ref. 26), Fos’’, Tbx18 (ref. 28)) as well as patient 
mutations in a further four top transcription factor genes (POU4F3 
(ref. 29), ZIC2 (ref. 30), SOX10 (ref. 31), FOXF2 (ref. 32)) resulted in 
hearing-related defects. Similarly, mouse knockouts or patients with 
mutations in the transcription factors enriched in osteoblasts (CREB3L1 
(ref. 33), DLX5 (ref. 34), EBF2 (ref. 35), HAND2 (ref. 36), HOXCS (ref. 37), 
NFIX**, PRRX1 (ref. 39), PRRX2 (ref. 40), SIX] (ref. 41), TWIST] (ref. 42), 
SHOX”, Six2 (ref. 44)) had bone and osteoblast phenotypes. A sub- 
stantial fraction of top transcription factors (61% of mouse and 40% of 
human transcription factors) have relevant phenotypes recorded in 
knockout mice (Supplementary Table 10). 
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Inferring function from expression profiles 


Taking a pair-wise Pearson correlation matrix of the promoter expres- 
sion profiles we carried out MCL clustering* (Supplementary Methods) 
to group promoters that share similar expression profiles across the 
atlas. Figure 4 shows a graphical overview of the structure of the data 
(and the mouse counterpart is shown in Extended Data Fig. 9). We find 
6,030 cases of named genes with alternative promoters participating in 
two or more coexpression clusters (Extended Data Fig. 10). To evaluate 
and annotate these coexpressed groups, we tested for enrichment in 
specific Gene Ontology terms and in a curated database of 489 biolog- 
ical pathways. Of these, 356 pathways (174 KEGG, 114 WikiPathways, 
46 Reactome, 22 Netpath) were significantly enriched in at least one 
human coexpression group (FDR <0.05). Using this approach, 38% of 
the unannotated robust peaks (35,082 out of 91,269) were within a clus- 
ter with a significant association to a pathway. The annotated coexpres- 
sion groups are summarized in the website (http://fantom.gsc.riken.jp/ 
5/sstar/Browse_coexpression_clusters) and a detailed example identi- 
fying genes putatively involved in influenza A pathogenesis is shown in 
Extended Data Fig. 10a. 

Introducing sample ontology enrichment analysis (SOEA), we show 
that expression profiles can also be associated with cell, anatomical and 
disease ontology terms by testing for overrepresentation of terms in 
ranked lists of systematically annotated samples expressing each peak 
(Extended Data Fig. 11 and Supplementary Methods). Novel peaks can 
be annotated in this way. For example, an un-annotated DPI peak at 
hg19::chr18:3659943..3659972,+ is linked to the terms classical mono- 
cyte (CL:0000860; P value = 6.35 X 10 174 Extended Data Fig. 11h) and 
bone marrow (UBERON:0002371; P value = 2.7 X 10 °°). Manual 
examination of the profile confirms the transcript is predominantly 
expressed in myeloid cells with higher levels in CD14* monocytes. 
Applied to all CAGE peaks, 127,645 human and 44,449 mouse robust 
peaks were annotated as enriched in at least one CL, DOID or UBERON 
term (Extended Data Fig. 11i, j). The most commonly-enriched terms 
at a P value threshold of 10° 7° were classical monocyte (CL:0000860; 
26,634 peaks, 14%), bone marrow (UBERON:0002371; 22,387 peaks, 
12%) and neural tube (UBERON:0001049; 20,484 peaks, 11%) (Sup- 
plementary Table 13). This is consistent with the coexpression cluster- 
ing in Fig. 4 (green and purple spheres correspond to leukocyte and 
central nervous system enriched expression profiles) and indicates that 
a large fraction of the mammalian genome is dedicated to immune and 
nervous system specific functions. 


Conclusion 


The FANTOMS promoter atlas is a natural extension of earlier maps 
of active transcripts and promoters complementing the sequencing of 
mammalian genomes**”. It represents an advance in an order of mag- 
nitude in the wide range of cell types and the amount of data produced 
per sample, and using single-molecule sequencing avoided polymer- 
ase chain reaction (PCR), digestion and cloning bias**. We have iden- 
tified and quantified the activity of at least one promoter for more 
than 95% of annotated protein-coding genes in the human reference 
genome; only the activity of 1,225 promoters remains uncharacterized. 
Some of these may not actually be expressed. Some cannot be unam- 
biguously measured with CAGE due to copy number variants or closely 
related multigene families. The remaining promoters are probably 
expressed in rare cell types or during windows of development or states 
of cellular activation that are not readily accessible and remain to be sam- 
pled. A continued effort to add profiles from these cells will make it pos- 
sible to integrate them with the FANTOMS data, and to extract metadata 
to identify those regulatory elements that are new and lineage-specific. 

The FANTOMS data highlights the value in profiling primary cells 
as opposed to whole tissues. It also highlights the weakness of using 
cancer cell lines. The cancer cell lines generally fail to cluster in a sample- 
to-sample correlation graph with their supposed cell type or tissue 
of origin (Extended Data Fig. 12) and express more transcription fac- 
tors than primary cells (Extended Data Fig. 3g). The mutations and 
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Figure 4 | Coexpression clustering of human promoters in FANTOMS. 
Collapsed coexpression network derived from 4,882 coexpression groups 
(one node is one group of promoters; 4,664 groups are shown here) derived 
from expression profiles of 124,090 promoters across all primary cell types, 
tissues and cell lines (visualized using Biolayout Express® D (ref. 45), r>0.75, 
MCLi = 2.2). For display, each group of promoters is collapsed into a sphere, 
the radius of which is proportional to the cube root of the number of promoters 


chromosomal rearrangements that occur in cancer result in unique 
transcriptional networks that do not exist in the untransformed state 
and do not necessarily generalize across multiple tumours of the same 
type. In terms of building mammalian transcriptional regulatory net- 
work models that reflect the normal untransformed state, primary cells 
are the logical choice. They have normal genomes, and express in the 
order of 430 transcription factors at appreciable levels, ranking of which 
can be used to reduce the complexity further and identify key known 
regulators of cellular phenotypes. Focusing on these key regulators and 
motif searching in the corresponding cell-type-specific promoters pro- 
vides the data to build cell-type-specific regulatory network models 
and support a rational approach to identification of drivers required to 
reprogram cells from one lineage to another. Promoter-based expres- 
sion data also has direct practical applications in the interpretation (and 
re-interpretation) of the function of single nucleotide polymorphisms 
(SNPs) in genome-wide association studies (GWAS), which commonly 
occur in non-coding sequences. In accompanying manuscripts, reana- 
lysis of several GWAS data sets uncovered new disease associations in 
FANTOMS5S promoters and identification of regulatory SNPs within 
enhancers that were active in medically relevant samples (ref. 4 and man- 
uscript in preparation). Accordingly, the data will enable the design of 
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in that group. Edges indicate r > 0.6 between the average expression profiles of 
each cluster. Colours indicate loosely-associated collections of coexpression 
groups (MCLi = 1.2). Labels show representative descriptions of the dominant 
cell type in coexpression groups in each region of the network, and a selection 
of highly-enriched pathways (FDR <10° “) from KEGG (K), WikiPathways 
(W), Netpath (N) and Reactome (R). Promoters and genes in the coexpression 
groups are available online at (http://fantom.gsc.riken.jp/5/data/). 


genotyping arrays and sequence-capture systems to target regulatory 
variation, and the design of promoter constructs allowing researchers 
to specify the cell-type-specificity and absolute expression levels of their 
constructs (particularly for Cre-conditional knockouts” and gene ther- 
apy vectors”°). In all these respects, the FANTOM5S data set greatly 
extends the data generated by ENCODE’ to further our knowledge of 
genome function. 


METHODS SUMMARY 
All Methods are described in full in the Supplementary Information. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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A Sedna-like body with a perihelion of 80 


astronomical units 


Chadwick A. Trujillo’* & Scott S. Sheppard?* 


The observable Solar System can be divided into three distinct 
regions: the rocky terrestrial planets including the asteroids at 0.39 
to 4.2 astronomical units (AU) from the Sun (where 1 AU is the mean 
distance between Earth and the Sun), the gas giant planets at 5 to 
30 au from the Sun, and the icy Kuiper belt objects at 30 to 50 au 
from the Sun. The 1,000-kilometre-diameter dwarf planet Sedna was 
discovered ten years ago and was unique in that its closest approach 
to the Sun (perihelion) is 76 Av, far greater than that of any other 
Solar System body’. Formation models indicate that Sedna could be 
a link between the Kuiper belt objects and the hypothesized outer 
Oort cloud at around 10,000 au from the Sun’ *. Here we report the 
presence of a second Sedna-like object, 2012 VP 13, whose perihelion 
is 80 Au. The detection of 2012 VP,,3 confirms that Sedna is not an 
isolated object; instead, both bodies may be members of the inner 
Oort cloud, whose objects could outnumber all other dynamically 
stable populations in the Solar System. 

The inner Oort cloud objects probably formed on nearly circular 
orbits, allowing them to accumulate mass efficiently*’, and were later 
perturbed into the eccentric orbits we see today. We define an inner 
Oort cloud object as a body whose orbit is not readily formed with the 
known mass in the Solar System. This typically means a perihelion 
greater than 50 au (beyond the range of significant Neptune inter- 
action) and a semi-major axis in the range 150 AU< a < 1,500 AU. At 
above 1,500 Au objects may be considered to be in the outer Oort cloud, 
as galactic tides start to become important in the formation process”®. 
Sedna and 2012 VP,,3 are the clearest examples of inner Oort cloud 
objects because they do not interact significantly with any of the known 
planets owing to their extremely distant perihelia (Fig. 1 and Table 1). 

There are two main models for inner Oort cloud object formation. 
In the first model, planet-sized object(s) in the outer Solar System may 
perturb objects from the Kuiper belt outward to inner-Oort-cloud 
orbits. These planet-sized objects could either remain (unseen) in 
the Solar System*”’ or have been ejected from the Solar System during 
the creation of the inner Oort cloud’”. In the second model, close stellar 
encounter(s) can create the inner Oort cloud objects, possibly within 
the first ten million years (Myr) of the Sun’s life, when it resided within 
its birth cluster'**"'!*"'”. The outer Oort cloud objects could be created 
from galactic tides'*, but the more tightly bound inner Oort cloud 
objects are difficult to create without a close stellar encounter”’. A third, 
less explored, model is that the inner Oort cloud objects are captured 
extrasolar planetesimals lost from other stars that were in the Sun’s 
birth cluster”*”®. Each theory of inner Oort cloud object formation pre- 
dicts different orbital configurations for the population. Therefore, as 
more inner Oort cloud objects are discovered, their orbits will provide 
strong constraints on the inner Oort cloud object formation models 
and thus our Solar System’s evolution. 

To place constraints on the inner Oort cloud object population, we 
constructed a simple simulation of the observational biases affecting 
our survey (Methods). We find three basic results from this analysis: (1) 
there appears to be a paucity of inner Oort cloud objects with perihelia 
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Figure 1 | Sedna and 2012 VP, 3 are clear dynamical outliers in the Solar 
System. Eccentricity versus perihelion distance for the approximately 1,000 
minor planets with well-determined (multi-year) orbits beyond 10 au are 
depicted. There are no known objects with closest-approach distances 
(perihelia, q) between 55 Au and 75 Au, even though such objects would be 
closer, and should therefore be brighter and easier to detect than Sedna or 
2012 VP},3. This suggests there may be a paucity of inner Oort cloud objects 
with q< 75 au. The perihelia of Sedna and 2012 VP,,3 are much too distant 
from Neptune (30 Av) for their existence to be explained by the known mass in 
the Solar System. All error bars are smaller than the data symbols. 


in the range 50-75 AU, suggesting that the inner Oort cloud object popu- 
lation could increase with q > 75 Au; (2) the existence of 2012 VP113 
means that the inner Oort cloud object population must reach down to 
fairly small semi-major axes of about 250 au; and (3) there are no obser- 
vational biases that can explain the clustering of the argument of peri- 
helion (c) near 340° for inner Oort cloud objects and all objects with 
semi-major axes greater than 150 au and perihelia greater than Neptune. 

Although our survey was sensitive to objects from 50 AU to beyond 
300 Au, no objects were found with perihelion distances between 50 Au 
and 75 Au, where objects are brightest and easiest to detect. This was 
true for the original survey that found Sedna’ and the deeper follow-up 
survey~’. Figure 2 illustrates this—most simulated objects (and the real 
objects Sedna and 2012 VP,;3) are found near the inner edge of the popu- 
lation. If the inner Oort cloud objects had a minimum perihelion of 50 AU 
and followed a size distribution like that of the large end of all known 
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Table 1 | The orbital elements of the inner Oort cloud objects 


Object gq (AU) a (AU) e i (deg) Q (deg) « (deg) tp (JD) Epoch (JD) 
2012 VP113 80 266 0.698 24.051 90.806 293 2,443,300 2,455,857.1 
Sedna 76.23 542 0.8593 11.92840 144.466 311.01 2,479,197 2,456,400.5 


The barycentric elements for 2012 VP;,3 and the heliocentric elements for Sedna (from the NASA Jet Propulsion Laboratory Horizons tool for dwarf planets http://ssd.jpl.nasa.gov/?horizons) are given as semi- 
major axis (a), eccentricity (e), inclination (/), longitude of ascending node (@), argument of perihelion («), and perihelion distance (q), which has been derived from a and e. Time of perihelion (t,) and epoch are 
specified in terms of the Julian date (JD). The precision to which quantities are known is reflected in the number of significant digits. We define inner Oort cloud objects as having q > 50 auand 150Au<a< 1,500 au 


(see main text). 
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Figure 2 | Results from the observational bias simulation. The simulated 
inner Oort cloud objects (dots) are shown, along with the simulated detections 
in our survey (blue crosses) and the most sensitive all-sky survey so far, which 
re-detected Sedna” (red crosses) for two different inner Oort cloud object 
models: our favoured model (a) and a comparison model (b). We have 
displayed ten simulated survey realizations per survey. The true values for 
Sedna and 2012 VP;;3 are shown as red and blue circles, respectively, both with 
error bars much smaller than the plot symbol (there are no error bars associated 
with all other symbols, which describe simulated data). There is one difference 
between the two models, the minimum perihelion for the underlying 
population (75 Au in a versus 50 Au in b). Our observational results favour the 
model in a because both Sedna and 2012 VP,,3 were found within a few AU of 
their perihelion. This indicates that the inner Oort cloud objects may have 
increasing numbers with increasing distance. 


472 | NATURE | VOL 507 | 27 MARCH 2014 


small-body reservoir distributions””* (q' = 5), there would be only a 1% 
chance of finding 2012 VP,,3 and Sedna with perihelion greater than 
75 AU and no objects with perihelion less than 75 au. Therefore, we 
conclude that there are few (although probably not zero) inner Oort 
cloud objects in the 50-75 Au region. Some stellar encounter models 
that include the capture of extrasolar material predict a strong inner 
edge to the perihelion distribution of objects, which is consistent with 
our observations”. 

Our simulation suggests that the inner Oort cloud contains about 
1/80th of an Earth mass of material for our favoured q' = 5 model, al- 
though this can vary based on the assumed size distribution (Methods 
and Extended Data Table 1). This is similar to that of the Kuiper belt 
objects (KBOs), which have about 1/100th ofan Earth mass of material”. 
We estimate that about 430+ 593(1¢) inner Oort cloud objects brighter 
than r= 24.3 remain undiscovered throughout the sky. In addition, 
900+ 859 (1c) bodies are expected with diameter larger than 1,000 km 
(most of which are extremely distant), which is statistically consistent 
with detailed model estimates based only on the discovery of Sedna’’. 
Our simulations assume that the inner Oort cloud objects are longitu- 
dinally symmetric, but it is interesting that both 2012 VP),3 and Sedna 
were found at similar sky locations. If further inner Oort cloud object 
discoveries prove this effect to be statistically significant, this would 
strongly constrain inner Oort cloud object formation models. 

Both Sedna and 2012 VP)3 have similar arguments of perihelion 
(@ = 311° and 293°, respectively). The orbital element « describes the 
angle between the point of perihelion and where a body’s orbit crosses 
the celestial plane. Surprisingly, this « similarity is shared for all known 
objects with semi-major axes greater than 150 Au and perihelia greater 
than that of Neptune (w ~ 340° + 55°; see Fig. 3, Extended Data Fig. 1 
and Extended Data Table 2). These extreme scattered disk objects may 
have a similar origin to Sedna or 2012 VP},3 but their much lower peri- 
helia (q < 49 av) allow their existence to be explained through resonant 
interactions with Neptune, except for 2004 VN, 12, 2010 GB,74 and pos- 
sibly 2000 CRygs (refs 10, 24-27). We find that this w clustering cannot be 
due to observational bias for two reasons: (1) any bias for @ = 0° is the 
same as the bias for @ = 180°; and (2) the surveys that have found the 
objects with clustered w were often off-ecliptic or all-sky surveys, which 
would not have a bias for either w = 0° or 180° (Methods). Thus, we con- 
clude that the @ clustering is a real effect. 

The Lidov—Kozai effect is the best known dynamical mechanism for 
constraining the ofa minor planet**. This three-body interaction can 
create outer Oort cloud objects with @ preferentially near 0° and 180° 
early in the history of the Solar System when the Sun was still among its 
nascent cohort of perturbing stars”. This cannot explain the w ~ 0° 
trend today, because w circulates owing to the presence of the giant 
planets. By numerically simulating the effect of the known mass in the 
Solar System on the inner Oort cloud objects, we confirmed that inner 
Oort cloud objects should have random @ (Methods). This suggests that 
a massive outer Solar System perturber may exist and restricts co for the 
inner Oort cloud objects. We numerically simulated the effect of a 
super-Earth-mass body at 250 au and found that for inner Oort cloud 
objects librated around 0° + 60° for billions of years (see Extended Data 
Figs 2 and 3). This configuration is not unique and there are many 
possibilities for such an unseen perturber. A super-Earth-mass body 
at 250 AU with very low albedo would be fainter than current all-sky 
survey detection limits, as would larger and more distant perturbers”®. 
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Figure 3 | The argument of perihelion for distant objects clusters about 0°. 
All minor planets with perihelion greater than 30 AU as a function of semi- 
major axis are shown. All bodies with semi-major axis greater than the line at 
150 au show a pronounced concentration near w ~ 0°. Errors on these orbital 
elements are much smaller than the plotted symbols. This figure appears in 
histogram form in Extended Data Fig. 1. 


Although such a perturber could keep the w near 0°, how objects ini- 
tially obtained w ~ 0° and not 180° is unknown. Stochastic events, such 
as a single strong stellar encounter or rogue planet could produce such 
asymmetric populations. A strong stochastic encounter could also pro- 
duce the low semi-major axis of 2012 VPj,3. It is possible that the w 
clustering indicates that all q > 30 au and a > 150 au bodies formed by 
a similar process, although bodies on elliptical orbits with q less than 
approximately 45 au can be explained by Neptune interactions. Models 
of galactic tides have difficulty producing inner Oort cloud objects having 
low semi-major axes (like Sedna) in the absence of stellar encounters” 
so the even lower semi-major axis of 2012 VP ,3 makes tides extremely 
unlikely. If the inner Oort cloud objects were created from the grav- 
itational tide within our Sun’s birth cluster, the low semi-major axis of 
2012 VP,;3 constrains the birth cluster to have been a moderate 10,000 
solar masses per cubic parsec or more, depending on the cluster model 
assumed’>”°. The moderately red optical colours observed for 2012 VPi13 
(Methods and Extended Data Table 3) are consistent with formation 
in the gas giant region and not in the ultra-red dominated classical 
Kuiper belt. Assuming a moderate albedo, 2012 VP,,3 is 450km in 
diameter, large enough to be considered a dwarf planet ifit is composed 
largely of ice. 


METHODS SUMMARY 


For this work, we conducted two main observational investigations. First, we sur- 
veyed 52 square degrees of sky for new inner Oort cloud objects using the Dark 
Energy Camera (DECam) at the Cerro Tololo Inter-American Observatory (CTIO) 
4-m telescope. Each patch of sky was imaged three times with 1.5 h to 2h between 
images, giving us the ability to detect the motion of objects beyond 300 Au when 
compared to background stars. Analysis software aided our search for moving 
objects. After its discovery at the CTIO 4-m telescope, we measured the position 
and colour of 2012 VP,,; using the Magellan 6.5-m telescope, completing the data 
collection portion of our project. We then created an observational bias simulation 
that simulated our discovery process to place constraints on the underlying number 
and orbital distribution of inner Oort cloud objects. This produces order-of-magnitude 
results for the number of inner Oort cloud objects whose basic parameters are 
poorly constrained because so few are known. Finally, after identifying the unusual 
nature of the @ distribution, we simulated the inner Oort cloud objects and the other 
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minor planets with semi-major axes greater than 150 Au and perihelion greater 
than 30 Au to explore the long-term behaviour of @ for different hypothetical 
planetary masses in the Solar System. These methods are described in greater detail 
in the Methods section. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Atomic and molecular samples reduced to temperatures below one 
microkelvin, yet still in the gas phase, afford unprecedented energy 
resolution in probing and manipulating the interactions between 
their constituent particles. As a result of this resolution, atoms can 
be made to scatter resonantly on demand, through the precise con- 
trol of a magnetic field’. For simple atoms, such as alkalis, scattering 
resonances are extremely well characterized”. However, ultracold 
physics is now poised to enter a new regime, where much more com- 
plex species can be cooled and studied, including magnetic lanthan- 
ide atoms and even molecules. For molecules, it has been speculated** 
that a dense set of resonances in ultracold collision cross-sections 
will probably exhibit essentially random fluctuations, much as the 
observed energy spectra of nuclear scattering do’. According to the 
Bohigas—Giannoni-Schmit conjecture, such fluctuations would imply 
chaotic dynamics of the underlying classical motion driving the 
collision® *. This would necessitate new ways of looking at the fun- 
damental interactions in ultracold atomic and molecular systems, as 
well as perhaps new chaos-driven states of ultracold matter. Here we 
describe the experimental demonstration that random spectra are 
indeed found at ultralow temperatures. In the experiment, an ultra- 
cold gas of erbium atoms is shown to exhibit many Fano-Feshbach 
resonances, of the order of three per gauss for bosons. Analysis of 
their statistics verifies that their distribution of nearest-neighbour 
spacings is what one would expect from random matrix theory’. The 
density and statistics of these resonances are explained by fully 
quantum mechanical scattering calculations that locate their origin 
in the anisotropy of the atoms’ potential energy surface. Our results 
therefore reveal chaotic behaviour in the native interaction between 
ultracold atoms. 

In the common perception, atoms are regarded as ‘simple’ systems 
in sharp contrast to ‘complex’ molecules, whose behaviour is dictated 
by many (rotational and vibrational) degrees of freedom. The recent 
realization of dipolar Bose-Einstein condensates and Fermi gases of 
magnetic lanthanides’”’ made a novel class of atoms available in the 
ultracold regime. These exotic species, such as erbium (Er), make it pos- 
sible to bridge the enormous conceptual gap between ‘simple’ atoms 
and molecules, potentially providing a natural test-bed with which to 
explore complex scattering dynamics in a controlled environment. The 
rich scattering behaviour of lanthanides has been pointed out in pio- 
neering experiments at millikelvin temperatures'*” and theoretical work 
on cold collisions of atoms with non-zero angular momenta'*”. 

The wealth of intriguing properties of Er, which is the focus of this 
Letter, originates in its exotic electronic configuration. Erbium is a 
submerged-shell atom with electron vacancies in the inner anisotropic 
4f ? shell, which lies beneath a filled 6s” shell. As a consequence, it not 
only has a large magnetic moment, of 7 Bohr magnetons (jg), but also 
has large electronic orbital and total angular momentum quantum 
number, of L = 5 and J = 6, respectively; we note that for bosonic and 
fermionic isotopes the nuclear angular quantum numbers are I = 0 and 


I= 7/2, respectively. Large values of L and J are sources of anisotropy in 
the interatomic interaction. Moreover, the two-body scattering is con- 
trolled by as many as 91 electronic Born—Oppenheimer (B-O) inter- 
action potentials, each potential accounting for a specific orientation of 
J with respect to the internuclear axis (Methods). All B-O potentials 
are anisotropic and include, at large internuclear separations, a strong 
dipole-dipole interaction and anisotropic van der Waals dispersion po- 
tentials. This situation is in contrast to that of conventional ultracold atoms, 
such as alkali-metal atoms, where the scattering is determined mainly 
by the isotropic singlet and triplet B-O potentials’. Recent theoretical 
work predicted the existence of anisotropy-induced Fano-Feshbach 
resonances in magnetic lanthanides’*. This greater complexity brings 
significant new challenges in understanding and exploiting scattering 
processes. 

Our experimental study is based on high-resolution trap-loss spec- 
troscopy of Fano-Feshbach resonances in an optically trapped ultra- 
cold sample of Er atoms in the energetically lowest magnetic Zeeman 
sublevel. We prepare the ultracold sample by following a similar cool- 
ing and trapping approach to that described in ref. 12 for bosons and 
ref. 13 for fermions (Methods Summary). After the preparation pro- 
cedure, the ultracold sample typically contains about 10° atoms at a 
temperature of around 400 nK. High-resolution trap-loss spectroscopy 
consists of many experimental cycles. In each cycle, we ramp the mag- 
netic field to a target value, B, and hold the atoms for 400 ms in the 
optical dipole trap, during which time they undergo elastic and inel- 
astic collisions. To probe the loss of atoms from the trap, we record the 
atom number by applying standard time-of-flight absorption imaging 
at zero magnetic field. In the next experimental cycle, we vary the mag- 
netic field value from 0 to 70 G in steps of a few milligauss and repeat 
the measurement. Figure 1 shows the respective loss spectra for '°*Er 
and '°°Er. For both isotopes, we observe very many resonant loss fea- 
tures, which we interpret as being caused by Fano-Feshbach resonances”. 
We identify 190 resonances for 168Fr and 189 resonances for }°°Er, 
meaning that we observe about 3 resonances per gauss. We performed 
similar spectroscopic measurements with the fermionic isotope '°’Er, 
revealing a much higher density of resonances that exceeds 20 reso- 
nances per gauss (Extended Data Fig. 1). The fermionic case is com- 
plicated by its additional hyperfine structure and detailed studies will 
be subject of future work. 

The very high density of resonances in Er is without precedent in 
ultracold quantum gases. For comparison, the density of resonances 
observed in experiments with ultracold alkali-metal atoms or even mix- 
tures is about two orders of magnitude lower than that in Er (compare 
with refs 19, 20). In Er, it is unclear whether a quantitative mapping of 
the observed resonances is possible at all. In principle there are at least 
91 unknown parameters, corresponding to the phase shifts introduced 
by the B-O potentials'*. Instead, we focus our theoretical analysis on 
fundamental issues, such as whether the observed density of reso- 
nances can be reproduced by microscopic calculations, and whether 
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Figure 1 | Fano-Feshbach spectrum of '°*Er and ‘Er from 0 to 70G. The 
trap-loss spectroscopy is performed in an optically trapped sample of Er atoms 
in the energetically lowest Zeeman sublevel, m; = —6, at a temperature of 
330nK. The atom number is measured after a holding time of 400 ms. We 


our results imply the presence of highly anisotropic interactions, which 
call into play resonant states of high orbital momentum. We answer 
these questions in the affirmative using full coupled-channel calcula- 
tions, supported by an analytical model. 

We construct a first-principles coupled-channel model for Er + Er 
scattering to calculate the spectrum of Fano-Feshbach resonances. Fol- 
lowing ref. 18, our model uses the atomic basis set and Hamiltonian 
(Methods) that includes the radial kinetic and rotational energy opera- 
tors, the Zeeman interaction and the 91 anisotropic B-O potentials. 
For small interatomic separations, R, the B-O potentials are calculated 
using the ab initio relativistic, multi-reference configuration—interaction 
method”'. At intermediate to large values of R, the B-O potentials are 
expressed as a sum of multipolar interaction terms. The van der Waals 
dispersion interaction potentials ( x 1/R°) are determined from experi- 
mental data on atomic transition frequencies and oscillator strengths”’. 
Animportant point is that the dispersion potentials have both isotropic 
and anisotropic contributions. The latter comes from the non-S state 
character of the Er electronic ground state. The B-O potentials thus 
induce either isotropic (€ and m¢ conserving) or anisotropic (f or me 
changing) couplings. Here, € and me are the partial-wave quantum 
number and its projection on the magnetic field quantization axis. 

We perform coupled-channel calculations for bosonic 168Fr, con- 
sidering s-wave (€ = 0) collisions and couplings to molecular states 
with even € up to Linax = 20. We calculate the elastic collisional rate 
coefficient as a function of magnetic field to obtain the Fano-Feshbach 
resonance spectrum. For Lax = 20, we observe a very dense resonance 
spectrum with about 1.5 resonances per gauss, which qualitatively re- 
produces our experimental observation (Extended Data Fig. 2). We 
note that resonances belonging to incident channels with >0 are 
substantially narrower and do not contribute much to the density of 
resonances. To get deeper insight into the role of the anisotropy of the 
potentials, we calculate the mean density of resonances, p, from our 
coupled-channel calculations for different values of the maximum par- 
tial wave Lax (Fig. 2). For Lax up to 20, we observe that p increases 
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observe 190 Fano-Feshbach resonances for ‘°*Er (a) and 189 resonances for 
'°°Er (b). Resonance positions are extracted by fitting a Gaussian shape to 
individual loss features; a full list is given in Extended Data Tables 1 and 2. 


with Lyax in a quadratic manner. This dependence stands in stark 
contrast to alkali-metal atoms, where high-¢ resonances tend to be 
too narrow to be observed. 

Because our limited computational resources do not allow us to 
perform calculations for Lyax > 20, it is worth estimating the density 
of resonances in a simpler way, based on the separated atom quantum 
numbers’. The key ideas of our model are the following. For each chan- 
nel |j, 71; j21Mj2, me), we define the long-range potential —C,/R° + 
W€(€ + 1)/2uR? + gup(my, + my2)B, where Cg is the isotropic van der 
Waals coefficient of the B-O potentials and / is Planck’s constant 
divided by 27. Here is the reduced mass, g is the atomic g-factor and, 
for ground-state Er, Cs = 1,723 atomic units (a.u.). Fano-Feshbach 
resonances in our open channel with m,; = —6 and mj2 6 are due 
to couplings to the most weakly bound rovibrational level of closed 
channels. For a van der Waals potential’** this bound state has a bind- 
ing energy that must fall within the ¢-dependent energy window [— A¢, 
0] with 4; > 0. The short-range potentials are not accurately known 
and, for each closed channel, there is a probability dE,/4¢ of finding a 
bound state with a binding energy between E, and E, + dE. From ref. 
24 and numerical simulations, we find A¢/E,aw ~ 38.7 + 25.5€ + 3.17€7, 


where Eyaw =" / 2x2 ww and xvaw = 4/ 2uCs / Ie /2. Each closed chan- 
nel contributes gupdm/A¢ to the mean resonance density, where 
ginom > 0 is the magnetic-moment difference of the closed and open 
channels and 6m is their difference in molecular projection quantum 
numbers. Adding the contributions for the closed channels gives the 
total mean resonance density. This simple counting technique, which 
we here name random quantum defect theory (RQDT), yields the mean 
density of resonances shown in Fig. 2. For Lmax < 20, the results of our 
analytic RQDT agrees very well with the exact coupled-channel calcu- 
lations. For larger values of Lyax, the density of resonances keeps 
growing and eventually saturates to a value around p ~ 4G _', which 
is close to the one observed in the experiment. RQDT shows that at 
least 40 partial waves need to be considered to reproduce the experi- 
mental observations. 
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Figure 2 | Mean resonance density for bosonic Er as a function of largest 
included partial-wave Ly,ax- Coupled-channel calculations for L,ax up to 20 
(circles) and RQDT calculation (solid line) for a magnetic field of 0 to 70G. 
For calculations, a collision energy of E = kg X (360 nK) is assumed. The 
mean densities of resonances measured in the experiment are shown for ‘Er 
(dashed line) and for !°°Er (dash—dot line) with 1o confidence bands 
(shaded areas). 


Our microscopic models reproduce well the qualitative behaviour of 
the system. However, given the complexity of the scattering, the ana- 
lysis of ultracold collision data can and should no longer aim to assign 
individual resonances, and the fundamental question of how to tackle 
complex scattering arises. Historically, spectra of great complexity have 
been understood within the framework of random matrix theory (RMT), 
as originally developed by Wigner to describe heavy nuclei containing 
a very large number of degrees of freedom”. This is an alternative view 
of the quantum mechanics of complex systems, where individual energy 
levels and resonances are not theoretically reproduced one by one, yet 
their statistics can be described”*. RMT characterizes spectra by fluc- 
tuations of their energy levels and classifies their statistical behaviour in 
terms of symmetry classes; for example, the Gaussian-orthogonal en- 
semble (GOE) is appropriate in the case of a system with time-reversal 
symmetry, such as neutral atoms. 

Following RMT, the distribution of spacings between neighbouring 
levels (or resonances) characterizes the spectral fluctuations of the 
system and reflects the absence or the presence of level correlations 
in terms of a dimensionless quantity, s = 6B/d, where 5B is the space 
between any two adjacent levels and d= 1/f is the mean level spacing. 
Whereas the nearest-neighbour spacing (NNS) distribution P(s) of 
non-interacting levels is Poissonian, Pp = exp(—s), strongly interact- 
ing levels obey a totally different distribution, which, in the case of 
GOE statistics, is known as the Wigner-Dyson (W-D) distribution, or 
‘Wigner surmise”® 


7 1 
Pwo = Ssexp(—“s*) (1) 


which shows a strong level repulsion for small s: Pwp(0) = 0. The field 
of application of the W-D distribution is so vast as to make it a uni- 
versal feature of very complex systems, such as heavy nuclei, disordered 
conductors, zeros of the Riemann function in number theory, and 
even risk management models in finance**. Remarkably, the Bohigas- 
Giannoni-Schmit conjecture further enriched the field of applications 
of GOE statistics®”’, showing that it applies generally to chaotic systems, 
such as Rydberg atoms in strong magnetic fields and Sinai billiards, 
where only a few degrees of freedom are relevant but where motion 
in these degrees of freedom occurs on a highly anisotropic potential 
energy surface’. Recently, it has been speculated that even cold and 
ultracold atom-molecule collisions will show essential features of GOE 
statistics**. 

Inspired by these works, we statistically analyse both the experimental 
and calculated Fano-Feshbach spectrum according to RMT. To obtain 
the NNS distribution of resonances, we first derive p and the mean 
spacing between resonances, d, by constructing the ‘staircase function”. 
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This step-like function counts the number of resonances below a mag- 


B 
netic field value B and is defined as NV‘ (e)- | dB’ S °5(B—Bi), 
0 i 


where 6 is the delta function and B; is the position of the ith resonance. 
For our experimental data (Fig. 3a), the staircase function shows an 
increase of the number of resonances with B, which is linear for large B 
and flattens out towards lower magnetic field values (Fig. 3b). The 
density of resonances is given by the derivative of the staircase function. 
We evaluate p in the region above 30 G, where the staircase function 
shows a linear relationship (Methods), and we obtain p = 3.0(3) G7! 
and d=0.33(3) G. We perform a similar analysis with '°°Er and find 
that p =3.3(3) G~! and that d=0.31(3) G (Extended Data Fig. 3). For 
data from coupled-channel calculations, we find that p =1.5 G7! for 
Lax = 20 (Fig. 2). Finally, we derive the NNS distribution for the experi- 
mental and coupled-channel data by constructing a histogram of res- 
onance spacings. We choose a number of bins of the order of JN. , with 
N being the number of Fano-Feshbach resonances used for analysis”. 
We then rescale the histogram in terms of s and normalize the distri- 
bution to obtain P(s). 

Figure 4 is the main result of our statistical analysis for '°*Er. The 
plot shows the NNS distribution of the experimental and coupled- 
channel Fano-Feshbach resonances, together with the parameter-free 
Poisson and W-D distributions (equation (1)). We see an impressive 
agreement between the experimental result and the coupled-channel 
calculations. Remarkably, both follow a distribution much closer to the 
W-D one than to the Poissonian one. To quantify the agreement with 
the GOE statistics, we evaluate the reduced ¢ statistic, xv, between our 
data and the Poisson and W-D distributions. We find that 7y =0.9 
and 7; =2.3 for our experimental data and that 7{,, =0.8 and 7 = 
3.0 for the data of the coupled-channel calculations. The fact that 7p 
<1 confirms that our data are well described by a W-D distribution. 
Similar results are found for '°°Er (Extended Data Fig. 4). 

To investigate the spectral correlations further, we analyse our data 
in terms of other statistical quantities, such as the number variance and 
the two-gap distribution function” (Methods). The number variance, 
(AB), measures the fluctuations in the number of resonances in a 
magnetic field interval AB (ref. 7; Methods). For uncorrelated (Poisson- 
distributed) levels, X 2 — AB, indicating large fluctuations around a 
mean value. For quantum chaotic systems, the correlations are strong 
and the fluctuations are thus less spread out. In this case, Y? ocln(AB). 
This slower increase of the number variance is regarded as indicating 
the strong ‘spectral rigidity’ of the system’. Our observations clearly 
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Figure 3 | Loss-maxima position and staircase function for ‘Er. 

a, Positions of the measured loss maxima of Fig. 1 are shown as vertical lines. 
b, The staircase function shows a linear dependence on the magnetic field at 
large values. A linear fit to the data above 30 G is plotted in a lighter colour. 
Inset, magnification of the data to emphasize the step-like nature of the 
staircase function. 
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Figure 4 | NNS distribution and number variance. a, !“°Er NNS distribution 
above 30 G with a bin size of 160 mG. The plot shows the experimental data 
(circles) with the corresponding Brody distribution (which interpolates 
between Poisson and Wigner-Dyson distributions) (solid line), the Brody 
distribution for the coupled-channel calculation with L,,, = 20 (short-dashed 
line), and the parameter-free distributions Pp (long-dashed line) and Pwp 
(dash-dot line). The Brody distribution is given in Methods. For the error 
bars in the experimental data, we assume a Poisson counting error. 

b, Number variances for the experimental data (solid line) with a 2o confidence 
band (shaded area), the coupled-channel data (short-dashed line), 

Pp (long-dashed line) and Pwp (dash-dot line). 


deviate from the Poissonian behaviour, showing that 5* tends to the 
W-D case (Fig. 4b), and confirm the presence of correlations in our 
system. 

Our observations reproduce the salient features predicted by GOE 
statistics for chaotic systems, the level repulsion and the spectral rigid- 
ity. This implies a degree of complexity in Er + Er cold collisions unpre- 
cedented in any previous ultracold scattering system. Our results bring 
the powerful analytical tools of quantum chaos to bear’. In particular, 
these approaches connect the large-scale structure of the spectra to 
simple features such as the shortest closed classical orbits in the poten- 
tial energy surface, where these connections are made by the Gutzwiller 
trace formula”’. Identifying the most important closed orbits will then 
shed light on the potential energy surface itself, providing a route to 
describing ultracold collisions that is complementary to the elaborate 
close-coupling calculations, which will be difficult to connect in detail 
with the data. 

Erbium systems are the first in which statistical analyses and chaotic 
behaviour are important to ultracold collisions, but they will not be the 
last. Specifically, much experimental effort is being exerted on produc- 
ing ultracold molecular samples, which also have highly anisotropic 
potential energy surfaces. Learning to understand complex spectra, by 
acknowledging their essentially chaotic nature, represents a turning point 
in how the field will consider ultracold collisions in future and provides 
new inroads into ultracold chemistry. 


METHODS SUMMARY 


For bosonic sample preparation, we follow the approach of ref. 12. We obtain about 
3 X 10° optically trapped atoms at a density of 3 X 10'? cm” *. The trap-loss spec- 
troscopy is performed in a trap with frequencies of (¥,, Vy, Vz) = (65, 26, 270) Hz. 
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The temperature of the cloud is measured by time-of-flight imaging at 0.4 G, which 
yields Ti¢g = 326(4) nK for 168Fr and Ty66 = 415(4) nK for ‘°°Er. We ramp the 
magnetic field to a probe value between 0 and 70 G within 10 ms, and hold the atomic 
cloud in the optical dipole trap for 400 ms. We observe an increase in the temper- 
ature to 560 nK at a magnetic field of about 50 G due to the ramping over many 
Fano-Feshbach resonances. For fermionic sample preparation, we follow the approach 
of ref, 13. We obtain about 1.2 X 10° fermionic atoms at a density of 2 10'*cm7* 
and at a temperature of 0.4T;, where T; = 1.0(1) K is the Fermi temperature. The 
trap frequencies are (v,, Vyp vz) = (427, 66, 457) Hz. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Photosynthesis in the surface ocean produces approximately 100 
gigatonnes of organic carbon per year, of which 5 to 15 per cent is 
exported to the deep ocean’. The rate at which the sinking carbon 
is converted into carbon dioxide by heterotrophic organisms at depth 
is important in controlling oceanic carbon storage’. It remains uncer- 
tain, however, to what extent surface ocean carbon supply meets the 
demand of water-column biota; the discrepancy between known 
carbon sources and sinks is as much as two orders of magnitude*®. 
Here we present field measurements, respiration rate estimates and 
a steady-state model that allow us to balance carbon sources and 
sinks to within observational uncertainties at the Porcupine Abyssal 
Plain site in the eastern North Atlantic Ocean. We find that pro- 
karyotes are responsible for 70 to 92 per cent of the estimated remin- 
eralization in the twilight zone (depths of 50 to 1,000 metres) despite 
the fact that much of the organic carbon is exported in the form of 
large, fast-sinking particles accessible to larger zooplankton. We 
suggest that this occurs because zooplankton fragment and ingest 
half of the fast-sinking particles, of which more than 30 per cent may 
be released as suspended and slowly sinking matter, stimulating the 
deep-ocean microbial loop. The synergy between microbes and zoo- 
plankton in the twilight zone is important to our understanding of 
the processes controlling the oceanic carbon sink. 

The global carbon cycle is affected by biological processes in the oceans, 
which export carbon from surface waters in the form of organic matter 
and store it at depth, in a process called the ‘biological carbon pump’. 
Most of the exported organic carbon is processed by the water-column 
biota, which ultimately convert it into CO, by means of respiration 
(remineralization). Variations in the resulting decrease in organic flux 
with depth’ can, according to models, lead to changes in atmospheric 
CO, of up to 200 p.p.m. (ref. 3), indicating a strong coupling between 
biological activity in the ocean interior and oceanic storage of CO>. 

A key constraint in the analysis of carbon fluxes in the twilight zone 
is that, in the steady state, the attenuation of particulate organic carbon 
(POC) flux with depth should be balanced by community metabolism. 
Published estimates of POC flux attenuation with depth are, however, 
up to two orders of magnitude lower than corresponding estimates of het- 
erotrophic metabolism*”’. This discrepancy indicates either that estimates 
of POC flux, community metabolism, or both, are unreliable, or that addi- 
tional, unaccounted for, sources of organic carbon to the twilight zone exist®. 

We compiled a comprehensive carbon budget of the twilight zone 
on the basis of an extensive programme of field measurements at the 
Porcupine Abyssal Plain (PAP) site (Extended Data Fig. 1a) in July and 
August 2009. This site is located in the transition region between the 
subtropical and subpolar gyres of the North Atlantic’®. The mixed-layer 
depth remained constant at approximately 50 m throughout the study 
period. This depth was subsequently used as the upper boundary of 
the twilight zone, given the need to normalize export measurements to 
dynamic upper boundaries for the twilight zone". 


Organic carbon sources to the twilight zone include sinking part- 
icles, downward mixing of dissolved organic carbon (DOC), lateral 
advection of organic matter from the continental shelf, active transport 
via the daily vertical migration of zooplankton that feed in the mixed 
layer at night and rest at depth during the day, and chemolithoauto- 
trophy (prokaryotic growth using dissolved inorganic carbon and chem- 
ical energy sources). 

The downward flux of sinking particles was measured using simul- 
taneous 48-h deployments of free-drifting, neutrally buoyant sediment 
traps’ at depths of 50, 150, 300, 450 and 600 m (Extended Data Table 1). 
Satellite chlorophyll imagery and horizontal velocities (obtained using a 
150-kHz vessel-mounted acoustic Doppler current profiler) confirmed 
that all of the traps were advected along the edge of an anticyclonic eddy 
for 50 km before surfacing within 3.5 km of each other. The measured 
POC flux at 50 m (84 + 8mgCm *d_') was close to estimates inde- 
pendently derived using ***Th budgets and studies of collected marine 
snow particles’ (99 + 41 and 146 + 26mgCm “d ', respectively). 
Attenuation of POC flux (F) with depth (z) was fitted to the Martin 
curve? (F, = Fyp(z/ MLD)’), where Fyip is the flux at the bottom of 
the mixed layer, MLD is the depth of the bottom of the mixed layer and 
b is the rate of attenuation. The observed attenuation rate (b = —0.70; 
P<0.01, R?=0.95, n= 5) was consistent with observations in the 
Pacific Ocean®"* (b = —0.50 to —1.38; Fig. 1a). Downward POC flux 
was extrapolated to 1,000 m using b = —0.70. The total loss of POC 
within the twilight zone was 74 +9mgCm *d_'. 

DOC input to the twilight zone was estimated to be 15 mg DOC m 7d 
(range, 0.4-30 mg DOC m 7d‘) on the basis both of the ratio between 
DOC concentrations and apparent oxygen utilization’*, and of DOC 
gradients coupled to turbulent diffusivity measured from previous work 
at the study site’® (Methods and Extended Data Fig. 2). DOC was esti- 
mated to supply 17% of total export, in agreement with previous esti- 
mates of 9-20% across the North Atlantic basin’’. Organic matter input 
through lateral advection was assumed to be negligible from analyses of 
back-trajectories (derived from satellite-derived near-surface velocities 
over 3 months) of the water masses arriving at the PAP site during the 
study period, which suggested that the water had not passed over the 
continental slope (Extended Data Fig. 1b). The final source of DOC, 
namely excretion at depth by active flux, was estimated using net sam- 
ples of zooplankton biomass and allometric equations®”’, giving a supply 
of 3mgCm 7d '. Defecation and mortality at depth present further 
sources of organic carbon to the twilight zone, but these were excluded 
from the budget owing to large uncertainties associated with their esti- 
mation. Finally, chemolithoautotrophy has been suggested to be an impor- 
tant source of organic matter in the deep ocean’, but without strong 
evidence that this poorly understood process could provide a major con- 
tribution at our study site, we chose to exclude it from our carbon budget. 

The remineralization of organic carbon by zooplankton and prokary- 
otes was estimated from zooplankton biomass and prokaryotic activity. 
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Figure 1 | Sinks and sources of organic carbon to the twilight zone. a, POC 
flux (black dots) below the mixed layer (shaded area) at the PAP site during 3-6 
August 2009, fitted to the Martin equation (F, = Fs9(z/ 50)’; solid line; 

Fso = 78mgCm *d_', b= —0.70; P< 0.01, R? = 0.95, n =5). The observed 
attenuation is consistent with rates observed in the Pacific”’"* (grey area, 
dotted lines). Error bars show analytical error (s.d.). b, c, Depth profiles of 
respiration by non-migratory zooplankton (ZR) based on biomass samples 
(n = 58; b) and leucine incorporation (j1mol Leu m °d_}) by prokaryotes 
(power-law fit (solid) and interquartile range (dashed); P< 0.001, R? = 0.86, 
n = 37; c). d, The sum of net organic carbon supply (AOG; light grey) of POC, 
DOC and active flux (asterisk) matches respiration by non-migratory 
zooplankton (ZR; dark grey) and prokaryotes (PR; mid grey). Error bars 
represent upper and lower estimates (see text and Table 1). 


It is crucial to note that in a steady-state system, such as we assume this 
to be, organic carbon is lost from the system only by export or by remin- 
eralization. We focus entirely on community respiration as a measure 
of remineralization, which is a fundamental advance over previous 
methods to derive budgets (Methods). 

Zooplankton respiration was estimated by applying allometric rela- 
tionships® to biomass measurements derived from net samples collected 
vertically every 80 m, twice during both day and night, using the ARIES 
net system fitted with 200-t1m cod-ends (the narrow, exchangeable ends 
of the nets, which retain the samples) (Extended Data Table 1 and Extended 
Data Fig. 3). These allometric relationships are well constrained’®, but 
they are based on epipelagic zooplankton and our calculated respira- 
tion rates for the lower mesopelagic are therefore probably overestimates 
of the true rates*’. Zooplankton resident in the twilight zone, mostly 
detritivorous copepods (Oithona and Oncaea) and carnivorous chae- 
tognaths, had combined respiration rates of 15.2 and 12.7mgCm *d ! 
(50-1,000 m), respectively, during the two deployment periods (Fig. 1b). 
Migrating zooplankton (determined as the difference between day and 
night biomasses) were excluded from these estimates because we assume 
that they ingest sufficient carbon during grazing at the surface to satisfy 
their diagnosed respiration rates at depth (Methods). The organic car- 
bon they respire within the twilight zone is thus imported by daily ver- 
tical migration. 

Prokaryotic heterotrophic production was determined using bioassay 
isotope-dilution techniques with *H-leucine tracer”!. Leucine incorpora- 
tion rates were 41.7 + 21.2nmol Leum™*d 7‘ at 150 mand 6.6 + 4.1 nmol 
Leum *d_! at 500-750 m (Fig. 1c), similar to previous estimates in 
the eastern North Atlantic’? (37.7 and 7.5 nmol Leu m °d}, respec- 
tively). Integrated leucine incorporation based on a power-law fit was 
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14.5 mol Leu m?d! (interquartile range, 13.2-16.1 mol Leu m 7d 
P<0.001, R* = 0.86, n= 37). This fit was chosen on the assumption 
that bacterial activity follows the supply of organic carbon”, although 
we lack data from between 50 and 150 m to confirm this fit. The uncer- 
tainty in this interpolation possibly leads to a misestimate of integrated 
leucine incorporation. Integrated leucine incorporation was converted 
into respiration using leucine-to-carbon conversion factors (0.44 + 
0.27 kg C mol‘ Leu) and growth efficiencies (interquartile range, 0.04— 
0.12) specific to the twilight zone derived from thorough literature 
surveys (Methods and Extended Data Fig. 4). The uncertainty in this 
calculation was estimated by bootstrap analysis with 100,000 simula- 
tions. The final estimate for integrated (50-1,000 m) prokaryotic respi- 
ration was71 mg Cm *d_' (interquartile range, 35-152 mgCm 7d). 

The sum of the inputs from POC and DOC matches community res- 
piration (68-116 versus 48-167 mgC m 7 d_'; Fig. 1d), with prokary- 
otes dominating community respiration (70-92%; Table 1). 

Our study successfully reconciles the various components of the car- 
bon budget in the twilight zone of the ocean. This was possible because 
we considered a dynamic upper boundary for the twilight zone (the 
base of the mixed layer), excluded vertical migrators from the estimate 
of zooplankton respiration in the twilight zone, and compared respira- 
tion rather than carbon demand to net organic carbon supply. Depth- 
resolved estimates of supply and consumption (Extended Data Fig. 5) 
show an excess of supply in the upper twilight zone (50-150 m) and a 
deficit in the lower twilight zone (150-1,000 m). We suggest that this 
may be caused by a subtle vertical change in ecosystem structure with 
depth**** or an unaccounted-for vertical transfer of organic carbon 
between the upper and lower twilight zones. 

The suggestion that prokaryotes dominate community respiration 
seems counterintuitive given that organic carbon supply to the twilight 
zone is dominated by sinking particles that are accessible to larger 
(>200 ttm) zooplankton. We therefore propose that one of the main 
roles of zooplankton in the twilight zone is to mechanically degrade 
particulate material” into slow-sinking particulate matter and dissolved 
organic material that is subsequently remineralized by microbes (pro- 
karyotes and their consumers). 

To explore whether this conceptual picture is consistent with our 
present understanding of twilight-zone ecology, and to provide a full 
quantitative picture of the twilight-zone carbon cycle, we used a simple 
steady-state model of that cycle’®. The model traces the turnover and 
remineralization of sinking POC along three pathways: colonization 
and solubilization of detritus by attached microbes, production of free- 
living microbes following loss of solubilization products during particle 
degradation, and consumption by detritivorous zooplankton (Methods 
and Extended Data Fig. 6a). The model was modified to include vertical 
mixing of DOC and active transport as carbon inputs to the twilight 
zone and to represent POC in both sinking and suspended forms, 
the latter produced via zooplankton ‘sloppy feeding” (leakage during 
ingestion). Inputs of carbon to the twilight zone were the measured 
values given in Table 1. 

Modelled respiration rates matched field data well, with 84% of the 
CO, being produced by microbes (prokaryotes and prokaryote con- 
sumers) and only 16% by zooplankton (detritivores and carnivores) 
(Fig. 2). The model further suggests that microzooplankton respiration, 


Table 1 | Carbon budget for the twilight zone (50-1,000 m) 


Input Respiration Community 
respiration (%) 

Sinking POC 74 (65-83) Zooplankton 14 (13-15) 16 (8-30) 
Vertical mixing 15 (0-30) 
(DOC) 
Active transport 3 Prokaryotes 71(35-152) 84(70-92) 
(DOC) 
Lateral advection ) 
Total 92 (68-116) 85 (48-167) _— 


Input fluxes and respiration rates (mgC m~*d~') are based on measurements at the PAP site. 
Numbers in brackets refer to lower and upper estimates (see text). Community respiration was 
estimated by combining highest and lowest estimates. 
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Figure 2 | Predicted carbon cycle in the twilight zone. Organic carbon is 
supplied to the twilight zone as POC and DOC (vertical mixing plus active 
transport) (green arrows). POC is processed by detritivores (50%) or attached 
prokaryotes (50%) and recycled in the twilight zone until eventually 
remineralized (red arrows), whereby prokaryotes dominate respiration (79%). 
Observed rates of carbon supply to the twilight zone and remineralization via 


which had not been measured during the study, has a small role in the 
overall budget, contributing only 5mgCm *d_'. Attached prokar- 
yotes processed half of the POC flux. The remaining half was processed 
by detritivorous zooplankton, which released 30% of it as suspended 
POG, thereby confirming our hypothesis. The relative roles of zoo- 
plankton and prokaryotes in processing and respiring sinking POC are 
robust to changing model parameter values (Methods and Extended 
Data Fig. 7). Moreover, it is consistent with the general perception that 
detritivores are sloppy feeders that ingest <40% of processed particles, 
causing most fast-sinking POC to break up into slow- or non-sinking 
POC and DOC”. This pool of suspended organic matter stimulates the 
microbial loop”* in the twilight zone and ultimately fuels the respira- 
tion of prokaryotes®”*. 

Our results highlight a synergy between zooplankton and microbes 
in the twilight zone, where both are important in processing the organic 
carbon flux and, subsequently, in controlling the strength of the oceanic 
carbon sink. Large uncertainties remain, however, particularly with 
regard to estimating prokaryotic activity. A better understanding of 
prokaryotic metabolism throughout the twilight zone, combined with 
process studies focusing on the upper twilight zone, is necessary to 
understand the biological carbon pump fully. 


METHODS SUMMARY 

We conducted an extensive programme of field measurements at the PAP site 
(49.0 °N, 16.5 °W) from 8 July to 13 August 2009 aboard RRS Discovery. Sinking 
material was collected for 48h using free-drifting, neutrally buoyant PELAGRA 
sediment traps’. Samples were screened to remove swimmers, split into aliquots, 
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respiration (‘CO,’) are displayed in green and red boxes, respectively. 
Internal net flows (mg C m 7 d_') derived from a numerical model are 
represented as arrows (line width reflects flow magnitude; dotted lines are 
flows <0.5 mg Cm *d™'). Fluxes indicated with an asterisk are for 
microzooplankton (prokaryote consumers), which are not included in the 
measured estimates. 


filtered onto pre-combusted glass fibre filters (grade GF/F), fumed with sulphurous 
acid and analysed for POC. DOC input was estimated from data collected near the 
PAP site during May-June 2005 (https://www.bodc.ac.uk/data/published_data_ 
library/catalogue/10.5285/f3b3d4e9-5ede-2824-e044-000b5de50f38/) and October- 
November 2005 (https://www.bodc.ac.uk/data/published_data_library/catalogue/ 
10.5285/f3b3d4e9-5edf-2824-e044-000b5de50£38/). The slope of the correlation 
between measured DOC and apparent oxygen utilization was compared with the 
theoretical slope (Co,g/—O2 = 117/170), giving the relative contribution of DOC 
to heterotrophic respiration’*. A lower estimate was calculated using turbulent dif- 
fusivity measurements at the PAP site’®, coupled with the aforementioned DOC 
profiles. Samples for zooplankton biomass profiles (0-1,000 m at 80-m intervals) 
were preserved in formaldehyde, size-fractionated, identified and enumerated. 
One to fifty individuals from each group at each depth and size fraction were ana- 
lysed for dry weight. Zooplankton respiration (in micrograms of carbon per indi- 
vidual per hour) was estimated as a function of body mass (in milligrams dry weight 
per individual) and temperature® (degrees Celsius). DOC excretion at depth was 
assumed to be equivalent to 31% of respiration by migrating zooplankton’. Leucine 
incorporation rates were estimated on samples (n = 37) recovered from depth 
using a conductivity-temperature-depth rosette sampler. Both time-course exper- 
iments and concentration-series bioassays were carried out. In these respective 
types of experiment, *H-leucine was added at final concentrations of 10-20 nM 
and 0.025-0.5 nM and incubated in the dark at in situ temperatures for 4-8 h and 
0.5-2 h. Samples were filtered onto 0.2-m polycarbonate filters and washed with 
deionized water, and their radioactivity was then measured. 

The cruise metadata report is available from the British Oceanographic Data 
Centre (http://www.bodc.ac.uk/data/documents/cruise/9451/). 
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Geomagnetic fluctuations reveal stable stratification 
at the top of the Earth’s core 


Bruce Buffett! 


Modern observations of the geomagnetic field reveal fluctuations’ * 
with a dominant period of about 60 years. These fluctuations are 
probably a result of waves in the liquid core‘, although the precise 
nature of the waves is uncertain. Common suggestions include a type 
of magnetic wave, known as a torsional oscillation’, but recent studies® 
favour periods that are too short to account for a 60-year fluctuation. 
Another possibility involves MAC waves’, which arise from the inter- 
play between magnetic, Archimedes and Coriolis forces. Waves with 
a suitable period can emerge when the top of the core is stably strati- 
fied. Here I show that MAC waves provide a good description of 
time-dependent zonal flow at the top of the core’, as inferred from 
geomagnetic secular variation’. The same wave motion can also ac- 
count for unexplained fluctuations in the dipole field’®. Both of these 
independent predictions require a 140-kilometre-thick stratified layer 
with a buoyancy frequency comparable to the Earth’s rotation rate. 
Such a stratified layer could have a thermal origin'’”’, implying a core 
heat flow of about 13 terawatts. Alternatively, the layer could result 
from chemical stratification’*™. In either case, the existence of a strati- 
fied layer at the top of the core obscures the nature of flow deeper in 
the core, where the magnetic field is continually regenerated. 

Evidence for a 60-year fluctuation in the geomagnetic field was first 
reported’ more than 40 years ago. Subsequent studies** using longer 
records have confirmed the existence of a dominant 60-year period, 
and find evidence for a weaker 30-year fluctuation. Initial explanations 
appealed to torsional oscillations’, which are nearly geostrophic motions 
in the form of fluid cylinders aligned with the rotation axis. The restor- 
ing force for wave motion is due to the component of the magnetic field 
that threads across the surface of the cylinders. A nominal 60-year period 
implies a relatively weak magnetic field of roughly 0.3 mT (refs 15 and 16). 
Stronger magnetic fields in numerical geodynamo models” imply shorter 
periods and recent observations® support the existence of torsional oscil- 
lations with periods of 4 to 6 years. 

Longer-period waves are possible when the top of the fluid core is 
stably stratified’. The required ingredients for wave motion include 
fluid buoyancy, rapid rotation and a radial magnetic field that passes 
through the stratified layer. Figure 1 shows a schematic illustration of 
the associated fluid velocity. A radial component of flow V, alters the 
distribution of fluid density in the stratified layer and induces a per- 
turbation in pressure. Owing to the effects of rapid rotation, the pres- 
sure perturbation causes an azimuthal flow V, much like cyclonic flow 
around a pressure low in the atmosphere. Azimuthal flow is opposed 
by the presence of the radial magnetic field, which drives a weaker 
meridional flow Vg towards the pole. The restoring force for the waves 
comes partly from the buoyancy force, which opposes the radial motion, 
and partly from magnetic tension, which opposes the horizontal flow. 
The direction of flow reverses after halfa cycle; an eastward azimuthal 
flow drives a meridional flow towards the Equator. 

Braginsky’ obtained analytical solutions for these waves when the 
radial magnetic field is approximated by an axial dipole. Different modes 
of oscillations are distinguished by the spatial pattern of radial flow. 
The gravest modes are associated with upwelling at the poles and down- 
welling at the Equator over half a cycle. For each pattern of radial flow 


there are overtones associated with shorter wavelengths in the radial 
direction. In general the overtones are more heavily damped than the 
fundamental modes. In addition, waves with a larger number of upwel- 
lings and downwellings have shorter periods. 

A description of waves with a more general form of radial magnetic 
field requires a numerical solution. Indirect inferences from geodetic 
observations'*””, as well as numerical geodynamo models”, suggest that 
a large part of the radial magnetic field near the core-mantle boundary 
resides in wavelengths that are too small to detect at the Earth’s surface. 
A simple, but plausible, description of the radial magnetic field at the 
core-mantle boundary assumes a random distribution. Because mag- 
netic tension does not depend on the direction of the field, I can approx- 
imate the root-mean-square (r.m.s.) value of the radial magnetic field 
as a constant over the surface of the core. Changes in the radial magnetic 
field across the stratified layer are small when the layer is sufficiently 
thin. 

Two parameters specify the waves when the r.m.s. value of the radial 
magnetic field is fixed by geodetic constraints at 0.6 mT (see Methods). 
One parameter is the thickness H of the stratified layer and the other is 
the buoyancy frequency N defined as 


_,|_ 8% 
a p or 


Figure 1 | Schematic illustration of the wave motion. Radial motion V, 
causes a pressure perturbation, which drives an azimuthal flow Vy in the 
stratified layer. The presence of a radial magnetic field opposes Vy and induces 
a meridional flow Vy. The fluid velocities reverse direction over a full cycle of 
the wave. 
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where p = 10*kgm_ ° is the fluid density, @p/0r is the radial derivative 
andg= 10ms “isthe acceleration due to gravity. Variations in density 
due to changes in hydrostatic pressure do not contribute to the buoy- 
ancy force, so the radial derivative in density is defined relative to varia- 
tions in a well-mixed, isentropic fluid. For simplicity H is assumed to be 
constant and N varies linearly with radius across the layer. The value of 
Nis largest at the core-mantle boundary and decreases to zero at the 
base of the stratified layer. As a result, the waves are completely defined 
by H and the maximum value of N (denoted as Nmax). 

Figure 2 shows a representative wave with a symmetric pattern of 
upwelling and downwelling in the Northern and Southern hemispheres. 
This particular wave is computed using H = 140 kmand Ninax” = 1.02Q”, 
where 2 = 0.73 X 10° *s°' is the rotation rate of the planet. The cor- 
responding period of the wave is 58 years, which is within the uncer- 
tainty of estimates for geomagnetic fluctuations. 

The absolute amplitude of the wave in Fig. 2 is not uniquely set by 
the computations, but the relative size of the velocity components reveals 
the structure of the wave. The largest flow is associated with Vy», whereas 
V, is about an order of magnitude smaller. Estimates of Vy and Vg have 
been inferred at the top of the core from variations in the geomagnetic 
field*?, so in principle we can look for evidence of wave motion in the 
surface core flow. The zonal part of Vy (that is, independent of longit- 
ude) has a value of the order of several kilometres per year, and exhibits 
large variations in both space and time (see Fig. 3). On the other hand, 
the zonal part of Vg is often prohibited by dynamical constraints in the 
inversion for flow*”°. Consequently, I begin my search for waves using 
only the zonal part of Vy. 

Estimates of Vy over the time interval between 1930 and 1990 are well 
represented by a linear combination of waves. The chosen time interval 
is long enough to capture the dominant 60-year period, but restricts the 
record to times when the surface core flow is most reliable. Values for H 
and Nimax are iteratively adjusted to minimize the misfit between the 
core-flow estimates and a linear combination of six MAC waves (see 
Extended Data Table 1). For each pair of model parameters (H, Nmax)> 
the amplitude and phase of the waves are estimated by the method of 
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Figure 2 | A representative wave inside the stratified layer. The velocity 
components V, (a), Vy (b) and Vg (c) are shown as a function of radius r and 
latitude, where r = 1 denotes the core-mantle boundary. This particular wave 
has a period of 58 years for the best-fitting values of layer thickness H and 
buoyancy frequency Ninax- 
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Figure 3 | Estimate of surface core flow and prediction due to MAC waves. 
a, Zonal flow Vy from the uvm-s model of Jackson® is shown as a function of 
sin0 and time. Only the long-wavelength part of the flow in spherical harmonic 
degrees 1, 3 and 5 is retained for fitting the MAC waves. The components of the 
spherical harmonic expansion that are least affected are the small-scale flow 
and magnetic field at the core-mantle boundary”. b, A linear combination of 
six MAC waves can reliably reproduce the surface core flow. The model 
parameters for the waves (H, Nmax) are iteratively adjusted to find the optimal 
fit to the surface core flow. 


least squares. The optimal fit to the core flow is shown in Fig. 3 using 
H= 140km and Nyax” = 1.02Q*. Good agreement means that the 
surface core flow is compatible with waves, but I cannot conclude that 
the wave motion uniquely accounts for the core flow. 

An independent indication of wave motion is found in fluctuations 
of the dipole field’®. Once a linear combination of waves is chosen to fit 
Vy, the resulting Vp component of flow alters the strength of the dipole 
field by sweeping the radial magnetic field into the polar regions. To 
quantify the changes in the dipole, I assume that the effects of magnetic 
diffusion are small for decadal fluctuations”®. In this case time varia- 
tions in the radial magnetic field B, are described by 


B,= —Vu*(VB;) 


where Vy denotes the horizontal part of the divergence and V is the 
velocity due to waves. Only zonal components of B, at the longest wave- 
lengths contribute significantly to the dipole fluctuation because the 
waves are zonal and predominantly large-scale. When B, is evaluated 
at the mid-range epoch of 1960 using the magnetic field model gufm1 
(ref. 21), I obtain predictions for the fluctuation in the dipole field. 
Restoring the effects of magnetic diffusion causes only a small change 
in the amplitude and phase of the dipole fluctuation (see Methods). The 
resulting predictions are compared with time variations in the gufm1 
model”, after removing a linear trend in the strength of the dipole (Fig. 4). 
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Figure 4 | Fluctuations in the dipole field due to MAC waves. Predictions for 
the interval 1930-1990 (a) and 1960-2008 (b) are based on the flow models of 
Jackson® and Wardinski and Lesur”, respectively. The observed fluctuations 
are represented in terms of the Gauss coefficient g?(t) from the gufm1 model” 
and the IGRF model”, after removing linear trends. 


The amplitude and frequency of the dipole fluctuation agrees remarkably 
well with the prediction based on MAC waves. Even the decay of the 
dipole fluctuation is broadly consistent with the damping associated 
with the waves, although an excitation source must be present to sus- 
tain the waves over longer time intervals. 

Evidence for MAC waves can also be found in more recent models 
of surface core flow. One example is the extended CFM model”, which 
gives estimates for the flow between 1957 and 2008 from a combination 
of satellite and observatory observations. The set of waves that best fit 
the model of Jackson‘ also gives a good fit to the C7FM model, although 
it is not possible to capture fluctuations with periods shorter than the 
shortest-period wave (see Methods). Slightly better fits to the C7FM model 
are possible with a modest increase in stratification to Nmax = 1.08Q7, 
but I retain the original set of waves to predict dipole fluctuations in 
more recent times. These predictions match the observations reasonably 
well with no adjustable parameters (see Fig. 4b). 

Satisfactory predictions for the dipole fluctuations in Fig. 4 require 
an electrical conductivity of 10°Sm_ 1}, which is consistent with recent 
theoretical predictions'*"*. Lower values of electrical conductivity cause 
excessive damping of the waves. Adopting larger values for H can offset 
the excess damping, but thicker layers underestimate the amplitude of 
the dipole fluctuation. In addition, the model appears to require a fairly 
uniform r.m.s. radial magnetic field over the surface of the core. Calcu- 
lations with a dipolar radial field alter the structure of the waves enough 
to prevent a good fit to both Vy and the dipole fluctuation. 

Stratification at the top of the core can arise in several ways. Thermal 
stratification is expected when the core heat flow is less than a hypo- 
thetical heat flow conducted through a well-mixed region at the top of 
the core”*™* (sometimes called the adiabatic heat flow). A simple model 
for thermal stratification”’ suggests that a core heat flow of 13 TW would 


486 | NATURE | VOL 507 | 27 MARCH 2014 


produce a stratified layer 140 km thick when the adiabatic heat flow 
is 15 TW (ref. 11). These two heat flows correspond to temperature 
gradients that differ by 0.12Kkm ', yielding a density gradient of 
0.012kgkm * at the core-mantle boundary. The resulting buoyancy 
frequency is 1.5, which is close to the estimate recovered by fitting V, 
to MAC waves. Alternatively, the stratification might arise from radial 
variations in composition. Convection in the underlying region would 
probably provide a continuous source of excitation for the waves because 
buoyant parcels strike the base of the stratified layer. The resulting 
wave motion would obscure deeper motions in the core at periods of 
roughly a century and less. 


METHODS SUMMARY 


Numerical solutions for the waves are adapted from a numerical geodynamo model”. 
Perturbations in the velocity and magnetic fields are expanded in vector spherical 
harmonics using a truncation at degree / = 40. Both of these fields are defined on 
a radial grid with 200 equally spaced levels. Radial derivatives in the governing 
equations are represented by second-order finite differences. When the time depend- 
ence is periodic, the governing equations and boundary conditions (see Methods) 
yield an eigenvalue problem for the frequency and structure of the waves. Iterative 
solutions are obtained using an incomplete Arnoldi method”. The resulting waves 
are used to compute the associated fluctuations in the dipole field. The induction 
equation for the perturbation in the radial magnetic field at the core-mantle 
boundary is discretized in latitude and solved using the finite difference method. 
The dipole component is extracted using the orthogonality of spherical harmonics. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Methane fluxes show consistent temperature 
dependence across microbial to ecosystem scales 


Gabriel Yvon-Durocher', Andrew P. Allen, David Bastviken?, Ralf Conrad‘, Cristian Gudasz>’°+, Annick St-Pierre’, 


Nguyen Thanh-Duc® & Paul A. del Giorgio’ 


Methane (CH,) is an important greenhouse gas because it has 25 
times the global warming potential of carbon dioxide (CO2) by mass 
over a century’. Recent calculations suggest that atmospheric CH, 
emissions have been responsible for approximately 20% of Earth’s 
warming since pre-industrial times”. Understanding how CH, emis- 
sions from ecosystems will respond to expected increases in global 
temperature is therefore fundamental to predicting whether the 
carbon cycle will mitigate or accelerate climate change. Methano- 
genesis is the terminal step in the remineralization of organic matter 
and is carried out by strictly anaerobic Archaea’. Like most other 
forms of metabolism, methanogenesis is temperature-dependent*”. 
However, it is not yet known how this physiological response com- 
bines with other biotic processes (for example, methanotrophy’*, sub- 
strate supply*’, microbial community composition®) and abiotic 
processes (for example, water-table depth”°) to determine the tem- 
perature dependence of ecosystem-level CH, emissions. It is also 
not known whether CH, emissions at the ecosystem level have a 
fundamentally different temperature dependence than other key 
fluxes in the carbon cycle, such as photosynthesis and respiration. 
Here we use meta-analyses to show that seasonal variations in CH, 
emissions from a wide range of ecosystems exhibit an average tem- 
perature dependence similar to that of CH, production derived 
from pure cultures of methanogens and anaerobic microbial com- 
munities. This average temperature dependence (0.96 electron volts 
(eV)), which corresponds to a 57-fold increase between 0 and 30°C, 
is considerably higher than previously observed for respiration (approx- 
imately 0.65 eV)"' and photosynthesis (approximately 0.3 eV)”. As 
a result, we show that both the emission of CH, and the ratio of CH, 
to CO, emissions increase markedly with seasonal increases in 
temperature. Our findings suggest that global warming may have 
a large impact on the relative contributions of CO, and CH, to total 
greenhouse gas emissions from aquatic ecosystems, terrestrial wet- 
lands and rice paddies. 

Biogenic CH, fluxes are a major component of global CH, emissions, 
yet they are poorly constrained*'*"*. There are large uncertainties not 
only in the current magnitude of these fluxes but also in the factors that 
regulate them””’. In particular, there is substantial uncertainty in the 
parameterization of the temperature dependence of natural CH, emis- 
sions in process-based biogeochemistry models'*""*, which greatly hin- 
ders our ability to predict the response of this key component of the 
carbon cycle to global warming. For example, temperature dependencies 
for ecosystem-level CH, emissions have reported apparent activation 
energies that vary from 0.2 to 2.5 eV (refs 6, 19-21) (leV = 96kJ mol’). 

Ina bid to reduce this uncertainty, which is fundamental to improv- 
ing projections of feedbacks between the carbon cycle and future climate 
change’*-'*, we quantified variation in the temperature dependence of 
CH, fluxes in meta-analyses of three different types of experiments (pure 


cultures of methanogens, laboratory incubations of anaerobic sediments, 
and seasonal field surveys of CH, emissions) that correspond to three 
distinct levels of biological organization (population, community and 
ecosystem). In particular, we assess whether ecosystem-level CH, emis- 
sions exhibit temperature dependencies similar to those of the underly- 
ing methanogenic process. To do this, we first established the magnitude 
and variability of the temperature dependencies of key rate processes 
for populations of methanogens in culture (methanogenesis, growth) 
and laboratory incubations of anaerobic microbial communities from 
natural sediment samples (CH, production). We then assessed whether 
these temperature dependencies differ from those observed in an 
ecosystem-level analysis of the seasonal temperature dependence of 
CH, emissions from aquatic, wetland and rice-paddy ecosystems. 

To characterize the temperature dependencies of physiological rate 
processes for methanogens, we fit the Boltzmann—Arrhenius function 
(which describes the exponential relationship between metabolic rate 
and temperature, assuming a single enzyme-catalysed reaction is rate- 
limiting”), separately, to the data compiled from the population and 
community-level experiments using linear mixed-effects models (see 
Methods). 

The population-level analysis reveals that the average temperature 
dependencies for the rates of methanogenesis and growth are similar. 
Specifically, the improvement in model fit going from a null model, 
which assumes a common average activation (Ey in equation (1), see 
Methods) energy for both rate processes to an alternative model, which 
assumes a distinct average activation energy for each rate process, is not 
statistically significant (likelihood ratio test: ~ = 0.39, d.f. = 1, P= 0.53). 
Thus, the average temperature dependencies for both rate processes 
(methanogenesis and growth) can be characterized using the same aver- 
age apparent activation energy (Ey = 1.10 eV, 95% confidence interval 
of 0.93-1.27 eV; Fig. 1a). 

The community-level analysis of CH, production rates from anaer- 
obic sediment incubations produces a similar value for the average acti- 
vation energy (Ey = 0.93 eV, 95% confidence interval of 0.82 - 1.03 eV), 
suggesting that the temperature dependence of CH, production at the 
community level largely reflects the kinetics of the physiological pro- 
cesses generating this flux. More detailed analyses of these community- 
level data indicate that the average activation energies for the sediments 
from three broadly defined ecosystem types (that is, aquatic ecosystems, 
wetlands and rice paddies) are statistically indistinguishable (likelihood 
ratio test comparing a null model with a single activation energy to an 
alternative model with separate activation energies for each ecosystem 
type: ¢ = 1.62, d.f. = 2, P= 0.44). However, for both the population- 
and community-level analyses, temperature dependencies do vary among 
experimental units, i, as indicated by the statistical significance of random 
effects on the activation energies (characterized by éz,,; in equation (1), 
see Methods and Table 1), owing, for example, to inherent physiological 
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Figure 1 | Temperature dependence of CH, production and related 
processes at population and community levels. Temperature dependencies 
for methanogen populations in culture (a) and anaerobic microbial 
communities from natural sediment samples (b) are separately characterized 
using mixed-effects models by fitting Boltzmann-Arrhenius functions with 
experimental-unit-level random effects on the apparent activation energy and 
rate at fixed temperature (see equation (1) in the Methods). Analyses reveal 
that average temperature dependencies are statistically indistinguishable for 
rates of growth and methanogenesis (a, open circles and filled circles, 
respectively) by populations of methanogens (P = 0.53), and rates of CH, 
production by laboratory-incubated anaerobic microbial communities from 


differences between strains of methanogens in culture and variation 
between sediment samples in microbial community composition. 

To assess how the average temperature dependence of CH, pro- 
duction for methanogenic populations and communities compares to 
that of CH, emissions from entire ecosystems, we fit the Boltzmann- 
Arrhenius equation to a database of 1,553 measurements of CH, emis- 
sion and temperature, measured seasonally for 127 field sites that span 
the globe and encompass wetlands, rice paddies and aquatic ecosystems. 
Analysis revealed that the estimated average activation energies are stat- 
istically indistinguishable for these three ecosystem types (likelihood 
ratio test comparing a null model with a single activation energy to an 
alternative model with separate activation energies for each ecosystem 
type: va = 4.97, d.f. = 2, P= 0.10), and can therefore be characterized 
by a common parameter (Ey, = 0.96 eV; 95% confidence interval of 
0.86-1.07; Fig. 2). This average temperature dependence is statistically 
indistinguishable to those observed for CH, production from popu- 
lations in culture and incubations of microbial communities from 
sediments—that is, all three estimates for Ey, have 95% confidence 
intervals that overlap (Table 1)—which is remarkable given the mul- 
titude of processes that may confound the temperature dependence of 
CH, emissions over a seasonal cycle at the ecosystem level. Thus, our 
analysis is consistent with the hypothesis that the seasonal temperature 


aquatic, wetland and rice-paddy sediments (b, squares, triangles and crosses, 
respectively) (P = 0.44). The fitted solid lines correspond to the average 
apparent activation energies estimated from the mixed-effects models 

(Em = 1.10 and 0.93 eV for the population (a) and community (b) analyses, 
respectively). Data have been standardized by subtracting from each 
measurement the estimated experiment-specific intercept; that is, the 
estimated rate at fixed temperature, In F(Tc), where Tc is the average of the 
temperature measurements for the population-level data (27.2 °C in a) and the 
community-level data (17.5 °C in b). These standardizations are used for 
visualization of the data only. 


dependence of CH, emissions at the ecosystem level largely reflects the 
kinetics of the methanogenic processes generating this flux (see Sup- 
plementary Information section 2 for further discussion of the poten- 
tial mechanisms governing the scaling of the temperature dependence 
of CH, fluxes from populations to ecosystems). 

Importantly, the average apparent activation energy we report here 
for the seasonal temperature dependence of ecosystem-level CH, emis- 
sions (Ey = 0.96 eV) is considerably higher than that reported previously 
for CO; fluxes attributable to respiration (~0.65 eV)"' and photosyn- 
thesis (~0.3 eV)”, which could have important implications for the 
effect of global warming on the balance of CH, and CO, emissions 
from ecosystems”. Given these differential temperature dependencies, 
we expected the ratio of CH, to CO2 emissions to increase, on average, 
with seasonal increases in temperature for a collection of sites (see Sup- 
plementary Information section 1). 

To test this prediction, we analysed ecosystem-level data for the sub- 
set of sites in our compilation where simultaneous measurements of 
CH, and CO, fluxes have been made, enabling us to calculate the efflux 
ratio of these greenhouse gases in response to seasonal variation in 
temperature. This data set comprises 177 estimates from 38 field sites. 
In exactly the same way as for the analyses of CH, emissions, we fit a 
linear mixed-effects model using the Boltzmann-Arrhenius function 


Table 1 | Estimates of the parameters used to characterize the temperature dependencies of CH, production, emission and related variables 


Rate or ratio Average activation energy 


(eV) (95% Cl) 


Standard deviation of random 
effect on rate or ratio at fixed 


Standard deviation of random 
effect on activation energy 


temperature 
Population-level rate of methanogenesis or growth 1.10 (0.93-1.27) 0.42 2.28 
Community-level rate of CH, production 0.93 (0.82-1.03) 0.32 2.45 
Ecosystem-level CH, emission 0.96 (0.86-1.07) 0.42 217 
Ecosystem-level CH4:CO2 emission ratio 0.71 (0.46-0.97) 0.59 1.03 


Parameters were determined by fitting Boltzmann-Arrhenius functions to rates (equation (1)) or ratios of rates (equation (2)) using mixed-effects models. The standard deviations of the random effects used to 
characterize differences among sites or experimental units in temperature dependencies (ég,; and ég,,,.; in equations (1) and (2) of the Methods, respectively) are significantly greater than zero in all analyses 
(P <0.05), as are the standard deviations of the random effects used to characterize differences in rates (or rate ratios) among sites or experimental units at fixed temperature (¢; and eg, respectively) (P< 0.001). 
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Figure 2 | Temperature dependence of CH, emissions at the ecosystem 
level. Temperature dependencies in a are characterized using a mixed-effects 
model by fitting a Boltzmann-Arrhenius function to the emission data with 
site-level random effects on the apparent activation energy and rate at fixed 
temperature (see equation (1) in the Methods). Analysis reveals that CH, 
emissions exhibit average temperature dependencies that are statistically 
indistinguishable for aquatic (squares), wetland (triangles) and rice-paddy 
(crosses) ecosystems (P = 0.10). The fitted solid line corresponds to the 
average apparent activation energy estimated from the mixed-effects model 


to the natural logarithm of the CH4:CO) flux data. As predicted, this 
ratio increases with increasing temperature for the majority of the field 
sites (35 of 38), yielding average temperature dependence across sites 
(characterized by Ejyy.c in equation (2), see Methods) of 0.71 eV (95% 
confidence interval of 0.46-0.97 eV; Fig. 3). This finding suggests that, 
on average, the relative contribution of CH, to total greenhouse gas 
emissions increases with seasonal increases in temperature due to the 
differences in the biochemical kinetics of methanogenesis, respiration 
and photosynthesis. 

Our analyses demonstrate that, on average, CH, emissions for a wide 
range of ecosystems show a temperature dependence similar to that of 
methanogenesis by methanogen populations in culture and CH, pro- 
duction from incubations of microbial communities from sediments. 
Moreover, this average temperature dependence is much higher than 
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(Em = 0.96 eV). The distribution of apparent activation energies among sites 
(b) yields an average (Ey = 0.96 eV, represented by a dashed line) that is 
statistically indistinguishable (P > 0.05) from the averages derived from the 
population- and community-level analyses (Fig. 1 and Table 1). In a, data 
have been standardized by subtracting from each measurement the estimated 
site-specific intercept; that is, the estimated rate of CH, emission at fixed 
temperature, In F(Tc), where Tc is the average of the temperature 
measurements in the field emissions data set (15.6 °C). This standardization is 
used for visualization of the data only. 


that of both respiration'' and photosynthesis”, resulting in a greater 
relative contribution of CH, to total carbon emissions at higher tem- 
peratures. Although this consistent temperature dependence of CH, 
fluxes from microbes to ecosystems is remarkable, our results also empha- 
size that temperature is not the only variable that controls CH, emis- 
sions from ecosystems. The substantial site-to-site variation we report 
for the temperature dependence of ecosystem-level CH, emissions (see 
Fig. 2b and Table 1) highlights the importance of other variables in driv- 
ing deviations from the underlying physiological response. Moreover, 
the average apparent activation energy we report here (Ey, = 0.96 eV), 
based on the seasonal temperature dependence of CH, emissions within 
sites, does not apply across sites along geographic temperature gradients. 
Indeed, the response of CH, emissions to geographic variation in tem- 
perature (Extended Data Figs 1 and 2) is consistently weaker than the 


Figure 3 | Temperature dependence of the 
CH,4:CO, emission ratio. Temperature 


dependencies are characterized in a using a mixed- 
effects model by fitting a Boltzmann-Arrhenius 
function with site-level random effects on the 
temperature dependence and ratio at fixed 
temperature (see equation (2) in the Methods). The 
fitted solid line in a corresponds to the average 
temperature dependence estimated from the 
mixed-effects model (Ey.c = 0.71 eV). The 
distribution of site-level temperature dependencies 
(presented as a histogram of slope estimates in 

b) yields an average (Eyy.c = 0.71 eV, represented 
by a dashed line in b) that is positive and significant 
(P> 0.05). Data in a have been standardized by 
subtracting from each measurement the estimated 
site-specific intercept; that is, the estimated ratio of 
CH4:CO} emissions at fixed temperature, In F(Tc), 
where Tc is the average temperature for the ratio 


data (15.3 °C). This standardization is used for 
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response to seasonal variation within sites (see Fig. 2). Furthermore, for 
natural wetlands, which span the broadest gradient in average temper- 
ature, we observe a significant negative correlation between the site-level 
intercept—that is, CH, emissions at fixed temperature, In F(T¢) (see 
equation (1) in Methods)—and average site temperature (Extended 
Data Fig. 2e). These results suggest that temperature-dependent vari- 
ation in other biotic (for example, methanotrophy*, substrate supply*”, 
microbial community structure and/or composition®, physiological accli- 
mation and/or adaptation”) and abiotic variables (for example, water- 
table depth”'®) may have an important role in constraining the response 
of CH, emissions to warming in the long-term, as has been suggested 
for rates of ecosystem respiration’. 

Overall, our findings provide an empirical basis for refining repre- 
sentations of the temperature dependence of CH, fluxes in coupled 
climate-carbon cycle models'*"'*”*. In particular, they provide evidence 
that the temperature dependencies of methanogenesis, CH, produc- 
tion and CH, emissions are substantially higher than two other key rate 
processes in the carbon cycle, heterotrophic respiration and photosyn- 
thesis. The observation of a general increase in the CH4:CO} ratio with 
increasing temperature, driven by the relatively high temperature depen- 
dence of CH, production, may have important implications for the 
magnitude of future positive feedbacks between global warming and 
the carbon cycle given the greater potency of CH, compared to CO, as 
a greenhouse gas*”’. 


METHODS SUMMARY 


Weused linear mixed-effects modelling” to quantify the temperature dependencies 
of metabolic rate processes for pure cultures of methanogens, anaerobic microbial 
communities and seasonal field surveys of CH, emissions by fitting Boltzmann- 
Arrhenius functions of the form: 


InF(T)= (Eww +6843) (gre — gp) + (INFCTe) +) (1) 


where In F;(T) is the natural logarithm of the rate of CH, flux at absolute temperature, 
T (K), for some arbitrary experimental unit, i. In this expression, each experimental 
unit, i, corresponds to a distinct strain of methanogen (population-level analysis), 
sediment sample (community-level analysis) or site (ecosystem-level analysis). The 
parameter Ey (in eV) corresponds to an average among experimental units for the 
apparent activation energy, which characterizes the temperature dependence of 
F,(T), and k is the Boltzmann constant (8.62 X 10° eV K '). We centred the 
temperature data separately in each analysis using the mean temperature for the 
data set, Tc, so that In F(T) corresponds to an average among experimental units 
for the rate at Tc. For each analysis, we expect estimates of the apparent activation 
energy, Ey, and the rate of CH, production at a fixed temperature, In F(Tc), to 
vary between experimental units. We account for this variation by treating the slopes 


and intercepts as random variables with averages of Ey, and In F(Tc), respectively, 
and deviations from these averages of éz,,,; and ¢f,; for each experimental unit, i. 

We used linear mixed-effects modeling”* to quantify the temperature depend- 
ence of the CH4:CO; emission ratio using a Boltzmann—Arrhenius-type approxi- 
mation (see Supplementary Information section 1) of the form: 


In Qi(T) = (Enc + Fae) (z 7 a) +(imQTe)+£0:) 2) 


where Q;(T) is the ratio of CH, to CO, emissions for site i at temperature T, Ev.c 
and In Q(Tc) are the respective averages across sites for the temperature depen- 
dence of this ratio and the magnitude of the ratio at a fixed temperature T¢;3 &8y,.c,1 
and €q,; are random-effects terms used to represent site-level deviations from these 
averages. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Statistical analyses. We used linear mixed-effects modelling’ to quantify the 
temperature dependencies of metabolic rate processes for pure cultures of metha- 
nogens, anaerobic microbial communities, and seasonal field surveys of CH, emis- 
sions, by fitting Boltzmann-Arrhenius functions of the form: 


iaF(T)= (Ey tas) = = a) + (InF(Tc) +¢e,) (1) 


where In F;(T) is the natural logarithm of the rate of CH, flux at absolute tem- 
perature, T (K), for some arbitrary experimental unit, i. In this expression, each 
experimental unit, i, corresponds to a distinct strain of methanogen (population- 
level analysis), sediment sample (community-level analysis) or site (ecosystem-level 
analysis). The parameter Ey, (in eV) corresponds to an average among experimental 
units for the apparent activation energy, which characterizes the temperature depen- 
dence of F;(T), and k is the Boltzmann constant (8.62 X 10° eV K_'). We centred 
the temperature data separately in each analysis using the mean temperature for the 
data set, Tc, so that In F(Tc) corresponds to an average among experimental units 
for the rate at Tc. For each analysis, we expect estimates of the apparent activation 
energy, Ey, and the rate of CH, production at a fixed temperature, In F(Tc), to 
vary between experimental units, which we account for by treating the slopes and 
intercepts as random variables with averages of Ey and In F( Tc), respectively, and 
deviations from these averages of ég,,; and ¢p; for each experimental unit, i. 

We used linear mixed-effects modeling” to quantify the temperature depen- 
dence of the CH4:CO, emission ratio using a Boltzmann-Arrhenius-type approxi- 
mation (see Supplementary Information section 1) of the form: 


(Emic + €8y.¢.i) (= a) (in Q(T) +20) (2) 


where Q;(T) is the ratio of CH, to CO, emissions for site i at temperature T, Exy-c 


In Q(T) 


and In Q (Tc) are the respective averages across sites for the temperature depend- 
ence of this ratio and the magnitude of the ratio at a fixed temperature Tc; ég,,,.. 
and ég; are random-effects terms used to represent site-level deviations from these 
averages. 

Mixed-effects models are the most appropriate tool for meta-analyses of the 
kinds of data compiled for this study because they allow for nested covariance 
structures, where site- or experimental-unit-level relationships are nested within 
overall relationships, and can accommodate unbalanced designs, where the num- 
ber of measurements vary among experimental units**’’. This approach enabled 
us to determine the overall average temperature dependence of flux (Ey, repre- 
sented by a ‘fixed’ effect) for a collection of experiments or sites, i, while accounting 
for the fact that multiple flux-temperature relationships (corresponding to ‘ran- 
dony effects on the slope (ég,,,;) and intercept (gp) for each i) are nested within 
this overall relationship. It also allowed us to explicitly quantify the variation in 
temperature dependencies among sites (or experimental units). 

The appropriateness of this technique is perhaps most easily explained for the 
analysis of field data in this study. Given that there are many other factors, besides 
temperature, that influence CH, emissions, we expect the estimated apparent 
activation energy to exhibit variation among sites. In particular, we expect the 
apparent activation energy to differ from that of the physiological flux whenever 
other variables that affect the rate process exhibit covariation with temperature 
(see ref. 11 for a theoretical derivation of this result). In this analysis, we are 
interested in two types of quantities: first, the fixed effects representing the average 
temperature response of CH, emissions across all 127 sites, Ey, and the average 
emission rate at fixed temperature, In F(T;); and second, the random effects 
(€g,,,i€F,1) representing the magnitudes of the deviations from these fixed effects 
for each of the 127 sites. The deviation from the fixed effects for any given site will 
be driven by other variables, besides temperature, that affect CH, emission and 
seasonally covary with temperature in the field; for example, methanotrophy’, water- 
table depth”””, substrate supply’, community composition®. Similar reasoning can be 
used to justify applying the mixed-effects modelling approach to the population- and 
community-level data. Although this statistical approach does not allow us to iden- 
tify the particular variables that contribute to differences in flux—temperature rela- 
tionships among sites or experimental units, it does allow us to quantify the overall 
magnitude of their effects as standard deviations of the random-effects terms. Given 
that the magnitudes of these deviations are assumed to represent a sample from a 
multivariate normal distribution, the resulting model can be generalized to a larger 
population of sites. 

Parameters were estimated by fitting the Boltzmann-Arrhenius equation to the 
data using mixed-effects models via the ‘mer’ function in the ‘Ime4’ package of R 
statistical software (v 3.0.2)”*. For these analyses, we adopted a top-down approach, 
starting with the most complex model, to determine the significance of the fixed and 
random effects in a two-stage analysis”®. In stage one, we determined whether it was 


necessary to include random effects corresponding to variation in both the slope 
and intercept among units, i, by using a likelihood ratio test”®”’ to assess the improve- 
ment in model fit going from a null model, which includes all potential fixed effects 
and only one random effect (corresponding to intercept variation), to an alterna- 
tive model, which includes all potential fixed effects and both random effects (that 
is, on the slope and intercept). Analysis revealed that the random-effects structure 
that best described each data set included random variation in both the slope and 
the intercept (see Table 1 for details of the statistics). In the second stage, we applied 
the random-effects structure determined in stage one to assess the significance of 
the fixed effects (averages across sites for the apparent activation energy and inter- 
cept), and other potential covariates, using likelihood ratio tests. The results of 
these likelihood ratio tests are given below and in the main text. 

Data compilations. For population-level experiments, measurements of rates of 
methanogenesis and temperature from cultures of methanogenic Archaea were 
compiled based on an exhaustive search of the literature. Experiments were only 
included in the analysis if they met the following criteria: rates were measured at 
three or more distinct temperatures at or below the optimum; rates were measured 
during the logarithmic (that is, exponential) phase of population growth, either in 
batch or continuous culture; rates could be normalized by population biomass. 
These criteria were met for 12 separate strains. Only data up to the optimum tem- 
perature were analysed. Two experiments of enrichment cultures for acetic acid 
conversion to CH, were also included because these culturing techniques result in 
numerical dominance by methanogenic populations”. In our analysis, these enrich- 
ment cultures had estimates of the apparent activation energy within the range exhib- 
ited by the pure cultures. 

To explore further the temperature dependence of metabolic rate processes in 
methanogens, we compiled additional data on methanogen growth rates, which 
have been reported more frequently in the literature than rates of methanogenesis. 
Given the rapid growth rates of these populations (that is, doubling times of less 
than 1 day), and that the energy transformations required to fuel this growth result 
from methanogenesis, growth rates of methanogens are likely to be closely linked 
to rates of methanogenesis and hence ecosystem-level CH, emissions. Growth rate 
data meeting the same three criteria listed above for methanogenesis were compiled 
for 21 strains of methanogens, and encompass both the acetoclastic and hydrogen- 
trophic groups. 

Mixed-effects model analysis indicates that the average activation energies are 
not significantly different for rates of methanogenesis and growth (likelihood ratio 
test comparing a null model with a single activation energy to an alternative model 
with separate activation energies for methanogenesis and growth rate; 7 = 0.39, 
d.f. = 1, P= 0.53). These findings suggest that methanogen growth efficiency is 
approximately independent of temperature for populations that are growing expo- 
nentially and are not therefore substrate limited. We also found no significant 
difference between the acetoclastic or hydrogenotrophic methanogens in the tem- 
perature dependencies of either growth rate or methanogenesis (likelihood ratio 
test comparing a null model with a single activation energy to an alternative model 
with separate activation energies for acetoclastic or hydrogenotrophic methano- 
gens; a = 1.30, d.f. = 1, P= 0.25). We therefore combined the data on rates of 
growth and methanogenesis for both groups to characterize an overall average 
temperature dependence for these metabolic rate processes in methanogen popu- 
lations (Fig. 1a). 

For community-level experiments, rates of CH, production were compiled 
from the literature from experiments in which sediment samples from aquatic, 
wetland and rice-paddy ecosystems, where incubated anaerobically under labor- 
atory conditions at multiple temperatures. Our data compilation consists of 402 
estimates of CH, production from 47 separate anaerobic communities sampled 
from 16 aquatic ecosystems, 26 wetlands and 5 rice-paddy ecosystems. Unlike the 
culture data, these data characterize the overall effect of temperature on the pro- 
duction of CH, by the entire sediment microbial community*’. 

CH, is produced as the terminal product of organic matter decomposition, and 
is known to be dependent on a series of syntrophic associations with anaerobic 
bacteria, protozoa and fungi, which supply the methanogens with the precursors 
for methanogenesis’, H2, CO, and acetate. It is accomplished by the more or less 
sequential reduction of NO; , Mn(IV), Fe(III), and SO,°-, before CH, produc- 
tion becomes the sole process of organic matter degradation*'*’. The length of this 
reduction period, during which organic matter is oxidised to CO2, and which 
precedes the production of CHy, is determined by the relative amounts of organic 
matter and inorganic electron acceptors in the sediment*'. These factors are likely 
to vary between experiments carried out on sediments from different sites. Thus, 
between-experiment variation in the length of the incubation period over which 
the estimate of CH, production is determined may influence the measured rate, 
and may also interact with temperature (for example, if the sequential reduction of 
electron acceptors proceeds faster at higher temperatures”) to influence the esti- 
mated apparent activation energy. 
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The experimental techniques applied in these studies were heterogeneous with 
respect to the duration over which the flux of CH, was determined—ranging from 
2 to 30 days—and with respect to whether or not there was a pre-incubation 
period. We found no evidence that length of the incubation duration affected 
estimates of the apparent activation energy when this variable was included as a 
fixed factor in our linear mixed-effects model (likelihood ratio test comparing 
an alternative model including an interaction between temperature and the length 
of the incubation duration to a null model with no interaction term: va = 0.95, 
d.f. = 1, P= 0.33). These findings suggest that the estimated apparent activation 
energy is largely unaffected by potential effects of temperature on the speed of 
alternative electron acceptor reduction. Among the 47 studies, 15 entailed pre- 
incubation periods ranging from 1 to 60 days in the dark at 4°C. We found no 
evidence that the presence or absence of an incubation period affected the esti- 
mated temperature dependence when this variable was included as a binary fixed 
factor in our mixed-effects model (likelihood ratio test comparing a null model 
with a single activation energy to an alternative model with separate activation 
energies for experiments with and without pre-incubation periods; 7° = 1.92, 
d.f. = 1, P = 0.38). In summary, estimates of the apparent activation energy appear 
to be largely unaffected by differences in experimental protocols among studies. 

Our database of ecosystem-level CH, emission data comprises both published 
data compiled from the literature and unpublished data analysed here for the first 
time (see below for data-collection methods for the unpublished data set). In total, 
this database comprises 1,553 estimates of CH, emission and temperature taken 
from 127 sites across the globe, including 57 aquatic ecosystems, 51 wetlands and 
19 rice paddies (Extended Data Fig. 1). CH, emissions were estimated using one of 
three methods: the floating-chamber technique; the eddy-covariance method; or 
occasionally (<5% of the estimates) the modelled-diffusion method, which yields 
estimates based on the concentration of CH, just below the air—water interface. 
The estimated apparent activation energies of seasonal CH, emissions did not vary 
significantly among the three methods when this variable was included as a fixed 
factor in the mixed-effects model (likelihood ratio test comparing a null model 
with a single activation energy to an alternative model with separate activation 
energies for each measurement method; ¢ = 0.52, df. = 2, P = 0.77), which jus- 
tifies combining data obtained using all three methods for the analysis. 

We assessed potential differences in the temperature dependence of CH, emis- 
sions among three ‘ecosystem types’-—aquatic, wetland and rice-paddy ecosys- 
tems—owing to differences in the physical and chemical characteristics of these 
ecosystems. ‘Aquatic’ ecosystems were defined as those that have standing water 
throughout the year, and included lakes and rivers. Wetlands were defined as natural 
systems with either permanently or seasonally water-saturated soils, and include 
peatlands, bogs and swamps. Finally, rice paddies were defined as man-made 
water-saturated sites established for rice cultivation. We separated these systems 
for the analysis, because the differences in their physical and chemical character- 
istics have the potential to modify the seasonal temperature dependence of CH4 
emission in a systematic way. For example, methanotrophy in aerobic zones can 
re-oxidize up to 90% of the CH, produced in anaerobic sediments®. Among these 
three ecosystem types, aquatic systems (lakes and rivers) are expected to have the 
largest aerobic zones beneath the air—water interface, and may therefore be 
expected to exhibit temperature dependencies that deviate furthest from the 
intrinsic temperature dependence of methanogenesis. However, as noted in the 
main text, although the temperature dependencies clearly varied between sites 
(Fig. 2b), we found no evidence of systematic differences in temperature depend- 
ence between the three ecosystem types (likelihood ratio test comparing a null 
model with a single activation energy to an alternative model with separate activa- 
tion energies for each ecosystem type; ¢ = 4.97, d.f. = 2, P = 0.10). This finding is 
broadly consistent with recent studies***°, which show that methanotrophy is 
primarily limited by substrate availability under in situ CH, concentrations in 
aquatic ecosystems, and exhibits no temperature dependence when CHs is lim- 
iting. Therefore, our results, and those of refs 33-35, suggest that methanotrophy 
generally does not interact with CH, production and temperature under in situ 
CH, concentrations. Rather, methanotrophy seems to track the seasonal dyna- 
mics of CH, production, resulting in little modulation of the observed tempera- 
ture dependence. 

Another factor that may systematically affect the observed temperature depen- 
dence of CH, emissions, at least for the fluxes estimated through the floating-chamber 
technique, is the relative contributions of fluxes attributable to diffusion versus 
ebullition. This is because ebullition has the potential to bypass the methano- 
trophic microbial populations in the oxic zones of the sediment and water column. 
We believe that the estimates of CH, emissions in our database are largely attrib- 
utable to diffusive fluxes, due to the extensive sampling effort (in both space and 
time) required to accurately quantify ebullition fluxes. Although none of our flux 
estimates could be partitioned into diffusive and ebullition fluxes, we attempted to 
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assess indirectly whether ebullition influenced the estimated temperature depend- 
ence by including the duration over which the flux chamber was deployed as a 
covariate in our mixed-effects model. In carrying out this analysis, we reasoned 
that ebullition fluxes were more likely to be assayed for chambers deployed for 
longer periods of time. However, we found no evidence ofa significant interaction 
between ‘flux duration’ and ‘temperature’ in the mixed-effects model (likelihood 
ratio test comparing an alternative model including an interaction between tem- 
perature and flux duration to a null model with no interaction term: ¢ = 0.85, 
d.f. = 1, P = 0.36) when analysing fluxes estimated using the floating-chamber 
technique (n = 1,007, 118 sites). 

The unpublished emission data included in our analysis come from lakes and 
rivers located in four distinct boreal regions of Québec, eastern Canada: Abitibi 
south (48.5° N, 79° W), Abitibi Borth (51° N, 79° W), Chibougamau (49.5° N, 74° 
W) and Chicoutimi (48° N, 71° W), and in boreal Sweden (58-59° N, 12-18° E). 
Geographic and climatic details of these regions can be found in ref. 36 and at 
http://www.smhi.se, respectively. The selected lakes and rivers were visited from 
three to 12 times during the open water season (May to November 2012). Surface- 
water partial pressures of CO (pCO; Canadian sites only) and of CH, (pCH4) 
were measured using the headspace method. Water was sampled at 0.1-0.5 m from 
the surface from the deepest measured point of lakes, and near the shores of streams, 
rivers and wetlands. Samples were taken using a 60-ml polypropylene syringe by 
extracting 30 ml of water, followed by 30 ml of ambient air, to create a headspace in 
a 1:1 ratio of ambient air to water. The syringe was then vigorously shaken for 
1 min in order to equilibrate the gases in water and air. Ambient pCO. was mea- 
sured in situ using an EGM-4 infrared gas analyser (PP-systems). pCH,4 samples 
(in duplicate) were placed in 30-ml glass vials equipped with crimped rubber 
stoppers, filled with saturated saline solution, and then stored inverted until sub- 
sequent analysis in laboratory with Shimadzu GC-8A Gas chromatograph with 
FID (flame ionization detector). In Canada, the CO and CH, fluxes were measured 
on the basis of changes in pCO, and pCH, with time in floating chambers. The 
chamber was connected in closed loop to an EGM-4 infrared gas analyser (PP- 
systems), with pCO, in the chamber measured every minute for 10 min. The accu- 
mulation rate was estimated from these data using linear regression. Samples for 
pCHy, were taken from a sampling port every 10 min, and analysed as described 
above. Flux calculations were carried out following ref. 37. The measurements in 
Sweden focused on CH, only allowing 24 h floating flux chamber deployments. 
The details of field approaches and flux calculations for these measurements are 
found in ref. 38. 

For ecosystem-level studies including CH,4:CO, ratio, the seasonal temperature 
dependence of the CH4:CO, efflux ratio was estimated for the subset of field sites 
in our database (38 of 127) with simultaneous measurements of CO, and CH4 
emissions. 
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Extended Data Figure 1 | Global distribution of the field sites included in _to the logarithm (base 10) of the mean rate of CH, emission (mg CH, per m? 
our analysis of ecosystem-level CH, emissions. The size of each point relates _ per day) over the duration of the experiment. 
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Extended Data Figure 2 | Correlations of average site temperatures with 
average CH, emissions and CH, emissions at fixed temperature for globally 
distributed ecosystems. Average CH, emissions are shown in a-c, and CH, 
emissions at fixed temperature are shown in d-f. Average site temperature is 
positively correlated with average CH, emissions, In F(T), for aquatic 
ecosystems (a) and natural wetlands (b), although temperature explains only 
12% and 9% of the variance, respectively, for CH, emissions from these two 
ecosystem types. In contrast, it is not significantly correlated with average CH, 
emissions for rice paddies (c). Average site temperature is also not correlated 


with CH, emissions at fixed temperature, In F(Tc) (where Tc is the average 
temperature across the field emissions data set (15.6 °C)), for aquatic 
ecosystems (d) and rice paddies (f), but is significantly negatively correlated for 
natural wetlands (e). The latter finding suggests that temperature-dependent 
biotic (for example, methanotrophy, substrate supply, microbial community 
structure, physiological acclimation and/or adaptation) and abiotic factors 
(for example, water-table depth) may play an important role in constraining 
variation in total CH, emissions among wetlands along geographic 
temperature gradients. 
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Geographical limits to species-range shifts are 
suggested by climate velocity 


Michael T. Burrows', David S. Schoeman?, Anthony J. Richardson*”, Jorge Garcia Molinos’, Ary Hoffmann®, Lauren B. Buckley’®, 
PippaJ. Moore”’®, Christopher J. Brown”, John F. Bruno®, Carlos M. Duarte!?4!?, Benjamin S. Halpern’*""*, Ove Hoegh-Guldberg’, 
Carrie V. Kappel'*, Wolfgang Kiessling’>!°, Mary I. O’Connor”’, John M. Pandolfi!®, Camille Parmesan!””°, William J. Sydeman”., 


Simon Ferrier”, Kristen J. Williams” & Elvira S. Poloczanska* 


The reorganization of patterns of species diversity driven by anthro- 
pogenic climate change, and the consequences for humans’, are not 
yet fully understood or appreciated’. Nevertheless, changes in cli- 
mate conditions are useful for predicting shifts in species distribu- 
tions at global* and local scales*. Here we use the velocity of climate 
change®’ to derive spatial trajectories for climatic niches from 1960 
to 2009 (ref. 7) and from 2006 to 2100, and use the properties of 
these trajectories to infer changes in species distributions. Coastlines 
act as barriers and locally cooler areas act as attractors for traject- 
ories, creating source and sink areas for local climatic conditions. 
Climate source areas indicate where locally novel conditions are not 
connected to areas where similar climates previously occurred, and 
are thereby inaccessible to climate migrants tracking isotherms: 
16% of global surface area for 1960 to 2009, and 34% of ocean for 
the ‘business as usual’ climate scenario (representative concentra- 
tion pathway (RCP) 8.5)* representing continued use of fossil fuels 
without mitigation. Climate sink areas are where climate conditions 
locally disappear, potentially blocking the movement of climate 
migrants. Sink areas comprise 1.0% of ocean area and 3.6% of land 
and are prevalent on coasts and high ground. Using this approach to 
infer shifts in species distributions gives global and regional maps of 
the expected direction and rate of shifts of climate migrants, and 
suggests areas of potential loss of species richness. 

By reorganizing natural systems’, climate change is poised to be one 
of the greatest threats to biodiversity of this century’, compromising 
the integrity, goods and services of living systems’. Increased under- 
standing of how species distributions and persistence are likely to be 
affected can inform effective conservation under climate change, as 
part of a range of considerations’. Predictions from complex models 
may incorporate ecological complexity but come with a high degree of 
uncertainty’®. A simpler approach is to consider the local speed and 
direction of shifting climate contours as an expectation of how species’ 
distributions would have to shift to track the location of their thermal 
niches®"®. This is the velocity of climate change®’: the temporal trend 
divided by the spatial gradient in a climate variable such as temperature® 
or precipitation’®”'. Landscapes and seascapes have different patterns of 
climate velocity’ and consequential residence times of climate, giving dif- 
ferent implications for species’ persistence and priorities for conservation’. 


Patterns of shifts in distributions of many taxa in the ocean have already 
been shown to follow the velocity of climate’. 

Here we use velocity-derived trajectories to indicate global regions 
susceptible to effects of geographical limits to climate-driven distri- 
bution shifts. Climate trajectories are paths that points on an isotherm 
will travel over specific periods (Fig. 1, see Methods for details), integ- 
rating spatially variable speeds and directions of climate velocity along 
the paths to show effects that static velocity maps cannot. Geographical 
limits to trajectories, either barriers such as coasts and mountains or 
lack of connections to cooler or warmer environments, suggest limits 
to climatic niche shifts and, by inference, species persistence at a local 
level or replacement from warmer environments (Figs 2 and 3). Velocity 
fields were derived for 1960-2009 for land and ocean surface tempera- 
tures’ on a 1° grid, allowing inference at a global scale, but sacrificing 
small-scale detail such as thermal minima on mountain tops or sharp 
gradients associated with ocean fronts”. 

We categorized types of trajectory behaviour using trajectory length 
and the percentages of trajectories starting in, ending in and passing 
through cells (Fig. 1 and Extended Data Fig. 1). Short trajectories indi- 
cated non- or slow-moving thermal niches. Cells were classed as rela- 
tive climate sinks if a high proportion of trajectories terminated there. 
Absolute climate sinks were also distinguished: coastal climate sinks 
where trajectories were blocked by coasts, and internal climate sinks 
where velocities in neighbouring cells converged. Cells were classed as 
climate sources if no trajectories ended there. Thereafter, cells with a 
high proportion of trajectories passing through were classed as corridors. 
Divergence cells were identified as those where fewer trajectories ended 
than started in that cell, and convergence cells if the opposite were true. 

Uncertainty evaluated by resampling of annual average temperature 
maps gave a ‘likely’ (>66%) consistency of designation of trajectory 
classes between bootstrap samples and the overall classification for 59% 
of ocean and 72% of land cells (Fig. 2 and Extended Data Figs 2-4). 
Consistency was <66% where spatial gradients and temperature trends 
are most uncertain, such as where inter- and multidecadal climate 
variability dominates as for the El Nifio-Southern Oscillation in the 
tropical Pacific. Classification of ‘very likely’ (>90%) was achieved for 
40% of land and 26% of ocean cells, mainly sources, coastal sinks and 
low velocity areas (Extended Data Fig. 4). 
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A Loss of climate migrants 
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A Little thermal shift 
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Figure 1 | Climate-change features emerging from the properties of climate 
velocity trajectories using the Australian landmass as an example. 

a, Trajectories (indicated by arrows) show predicted 50-year (1960-2009) shifts 
based on isotherm velocity. Features described by trajectory properties are 
shown; typical locations and physical characteristics are indicated on the map 
(stars denoting typical locations of sinks, sources, corridors, convergence and 


The proportions of land and sea areas classed as climate sources and 
sinks are similar at a global scale (sources: land 17.4%, ocean 15.9%; 
sinks: land 3.6%, ocean 1.0%), but the latitudinal pattern differs (Fig. 2 
and Extended Data Figs 5-9 for regional maps). For ocean surface 
temperatures, climate sources are concentrated within 10° latitude of 
the Equator (Figs 2b, d), a pattern not evident on land (Fig. 2a, c). 


oO Convergence and Ok) Divergence 


@ Appreciable temperature change 


A Little or no change 
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b Step 1 Trajectory length 
() 
<20 km 20-100 km >100 km 
(@) Non-moving © Slow-moving Step 2 
Step 2 


x) Coastal sink: trajectories blocked 


x) Internal sink: local temperature inflection 


Remaining cells: Step 3 
Step 3 


Trajectories ending (N,,,,4), flow-through (N,-;), 
starting (N.,) 


divergence, and slow- and non-moving areas). The respective implications of 
these physical characteristics for climate migrants are listed in the boxes 
(triangles). b, Hierarchical sequential classification of climate change features 
based on length of trajectories (step 1), geographical features (step 2) and 

the relative abundance of trajectories ending in, starting from and flowing 
through each cell (step 3). 


Topographic complexity on land generates more local warm and cold 
source and sink areas compared to the ocean. Importantly, 12.0% of 
land trajectories and 5.4% of ocean trajectories terminate in sinks, 
representing ‘lost’ local conditions: 6.1% (5.0%) land (ocean) ending 
in coastal sinks, 5.9% (0.4%) in internal sinks. These losses are anal- 
ogous to ‘disappearing climates’ but here result from local connections 


c 


Latitude 


i Sources 
i Corridors 


Non-moving Bi Divergence 


GB Slow-moving Convergence 


Figure 2 | Global patterns of climate trajectory classes. a-d, On land (a) and 
in the ocean (b), and proportional areas by latitude (c, d). Uncertainty in 
classification of areas is shown by the cross hatching on (a, c): <66% of 500 
bootstrap class maps consistent with underlying class map. Non-hatched areas 


Bi Sinks 


i Internal sinks 


Latitude 


100% 


BB Coastal sinks 0% 50% 


have classification that is ‘likely’ (>66% consistent). Uncertainty between 
convergence and divergence areas and slow and non-moving areas is not shown 
(see Extended Data Fig. 2). 
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7 — Figure 3 | Regional patterns of climate velocity trajectory classes for land 
para and sea surface temperatures. a, b, We show 50-year trajectories (shown by 
arrows) of climates (1960-2009) for Europe for 1° grid cells, overlaid on climate 


eT trajectory classes for ocean (a) and land (b). 


to thermal minima rather than global loss of combinations of clima- 
tic conditions. 

In the ocean, coastal sources form where poleward trajectories depart 
from coastlines, as in North Africa, and coastal sinks where equator- 
ward coastlines block trajectories, as in southern France (Fig. 3a). The 
opposite occurs on land to a degree: sources on equatorward coastlines 
and sinks along poleward coastlines. However, cooler, higher regions 

rt of continents, such as mountain ranges in Europe, ‘attract’ trajectories 
a towards the interior, disrupting the otherwise-poleward flow, resulting 
in internal sinks at greater elevations (Fig. 3b). Elevated land is also 
more likely to comprise non- or slow-moving areas. Corridors are 
evident in areas of convergent trajectories and high climate velocities, 
as in the northern North Sea and in southern Germany (Fig. 3a, b). 

Future climate trajectories for sea surface temperature (SST) using 
an example global climate model (ACCESS1.0) for 2006-2100 produce 
similar patterns of shifting climates to those seen globally for 1960- 
2009. Sources were indicated at thermal maxima around the equator 
(Fig. 4a, c). Greater warming in the RCP 8.5 ‘business as usual’ scenario® 
(2.95 °C increase in annual average SST for 2080-2099 over 1960-2009) 
results in longer trajectories than the RCP 4.5 scenario run (1.75 °C 
increase) and a doubling in size of areas identified as sources (RCP 8.5, 
34.1% of scenario cells as sources versus 19.8% for RCP 4.5). Local- 
scale patterns were also similar (Fig. 4b, d), as thermal gradients that 
determine trajectory direction are reproduced in the climate models. 
No-analogue climate futures’ will emerge in source areas with novel 
climates", particularly around the equator, but not in sources that result 
from coastal barriers disconnected from similar climates elsewhere. 

Each climate trajectory feature has different implications for the migra- 
tion patterns of climate-sensitive species through climate connectivity 
(Extended Data Table 1). Species richness in climate source areas may 
decline, because climate migrants leaving may not be replaced: sources 
lack connection routes for new migrants. Converging temperature 
isotherms in relative climate sinks may concentrate climate migrants, 
increasing local diversity, whereas absolute climate sinks represent cli- 
matic dead-ends where species cannot spread along thermal gradients 
into cooler areas, creating potential for local extinction. Large numbers 
of trajectories traversing a limited pathway suggest important corri- 
dors for climate migrants. 

Patterns of distribution shifts indicated by trajectory behaviour raise 
questions for ecology and conservation of climate-sensitive species’’, 
particularly when considered alongside the magnitude and latitude of 


ONon-moving MH Sources [Divergence i Sinks BB Coastal sinks 


HSlow-moving Hi Corridors L] Convergence @ Internal sinks 


Figure 4 | Global and regional 
patterns of 50-year climate 
trajectory classes based on trends 
from ensembles of global climate 
models for 2006-2100. a—d, Classes 
for two CMIP5 scenarios: the 4.5 W 
m_ ? (RCP 4.5) scenario (a, b) and 
the 8.5 W m 7 (RCP 8.5) scenario 
(c, d), derived from temperature 
trends and spatial patterns in 
temperature in data from the 
ACCESS1.0 CMIP5 global climate 
model. Arrows show the expected 
shift in location of points along 
isotherms over a 50-year period. 
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climate change’®. Long ‘climate residence times’ in areas of low velocity’ 
have been associated with high levels of historical endemism’’ and 
have led to such areas being proposed as genetic and evolutionary 
refugia'*. The non-replacement of climate migrants in climate source 
areas may result in net loss of species richness, and facilitate the estab- 
lishment of new species into abandoned niches’, such as in the eastern 
Mediterranean Sea via the Suez Canal’®. A larger number of inbound 
climate migrants in convergent areas, corridors and sinks implies that 
local communities should face greater reshuffling of species and novel 
ecological interactions”’, and compromised genetic diversity through 
gene swamping” but with increased adaptive gene flow”. Climate 
migrants face local extinction in climate sink areas, unless the species 
involved can adapt to changed conditions. 

Climate velocity is emerging as a good predictor for range shifts in 
the ocean*”, whereas the relationship is still to be investigated on land, 
although terrestrial species distribution shifts have been related to 
latitudinal shifts in isotherms”. The similarity in trajectory maps of 
future projections with past reconstructions suggests that the ecological 
implications and therefore the management actions for conserving 
biodiversity, as informed by current climate trajectories, could remain 
effective into the future. Species losses from both source and sink areas 
may accelerate with climate change, with greater warming resulting in 
areas of novel and lost climates, suggesting that impacts may be greatest 
in these areas. The approach developed here offers a rapid global method 
to quantify and map patterns of shifting thermal niches, and by implica- 
tion those species tracking thermal conditions, and highlights those 
areas of the globe that may be at risk from the effects of geographical 
barriers to climate migrants. 


METHODS SUMMARY 


Velocity fields were derived for 1960-2009 land and ocean surface temperatures 
using Hadley Centre HadISST v1.1 and Climate Research Unit CRU TS3.1 (ref. 7) 
data sets, and for global climate model (GCM) projections for 2006-2100 using 
example GCM model data from CMIP5 (Coupled Model Intercomparison Project 
Phase 5) experiments for RCP 4.5 and RCP 8.5 scenarios. Velocity was calculated 
by dividing the 50-year temperature trend by the spatial gradient in 50-year means, 
taking the direction from the spatial gradient®. Trajectories of climate were obtained 
by calculating displacements using local velocity. If trajectories hit a coastal barrier 
anda cooler or warmer cell was found locally then the trajectory continued towards 
that cell, else the cell was designated as a coastal sink. 

The collective behaviour of trajectories was obtained from 0.1°-spaced traject- 
ories projected over 50 years at 0.1-year intervals. Cells were classed (Fig. 1b) as: 
first, slow or non-moving where 50-year trajectories were less than 100 km; second, 
coastal and internal sinks, where trajectories halted on coastlines or converged towards 
a central point; third, five types based on proportions of trajectories starting from, 
ending in, and flowing through cells. These five types of cells are: sources, cells 
where no trajectories ended; sinks where many trajectories ended; corridors, cells 
with a high proportion of trajectories passing through; and divergence and conver- 
gence cells as those where fewer/more trajectories ended than started in that cell. 

Uncertainty of classification into areas based on trajectory behaviour was esti- 
mated using bootstrap resampling (” = 500) of the temperature data sets. Each 
bootstrap sample comprised a random selection with replacement of 50 annual 
mean temperatures from the original series, from which mapped temporal and 
spatial temperature trends, and thereby velocity, were calculated. 50-year trajectories 
based on these maps gave a bootstrap classification of trajectory areas. Consistency 
of types by grid cell was expressed as the percentage of cells in bootstrap maps that 
were the same type as the original classification. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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A suspension-feeding anomalocarid from the 


Early Cambrian 


Jakob Vinther!, Martin Stein’, Nicholas R. Longrich? & David A. T. Harper* 


Large, actively swimming suspension feeders evolved several times 
in Earth’s history, arising independently from groups as diverse as 
sharks, rays and stem teleost fishes’, and in mysticete whales”. How- 
ever, animals occupying this niche have not been identified from the 
early Palaeozoic era. Anomalocarids, a group of stem arthropods 
that were the largest nektonic animals of the Cambrian and Ordo- 
vician periods, are generally thought to have been apex predators”. 
Here we describe new material from Tamisiocaris borealis’, an ano- 
malocarid from the Early Cambrian (Series 2) Sirius Passet Fauna of 
North Greenland, and propose that its frontal appendage is special- 
ized for suspension feeding. The appendage bears long, slender and 
equally spaced ventral spines furnished with dense rows of long and 
fine auxiliary spines. This suggests that T. borealis was a micropha- 
gous suspension feeder, using its appendages for sweep-net capture 
of food items down to 0.5 mm, within the size range of mesozoo- 
plankton such as copepods. Our observations demonstrate that large, 
nektonic suspension feeders first evolved during the Cambrian explo- 
sion, as part of an adaptive radiation of anomalocarids. The presence 
of nektonic suspension feeders in the Early Cambrian, together with evi- 
dence for a diverse pelagic community containing phytoplankton”* 
and mesozooplankton””"®, indicate the existence of a complex pelagic 
ecosystem"! supported by high primary productivity and nutrient 
flux’, Cambrian pelagic ecosystems seem to have been more modern 
than previously believed. 


20 ss 


Figure 1 | T. borealis Daley and Peel, 2010, frontal appendages from Sirius 
Passet, Lower Cambrian, North Greenland. a, Isolated and relatively 
complete appendage, MGUH 30500 (Geological Museum at the University of 


T. borealis, from the Early Cambrian Sirius Passet fauna of North 
Greenland, has been described previously as a possible anomalocarid 
on the basis of a disarticulated frontal appendage®. New fossils not only 
substantiate the anomalocarid affinities of Tamisiocaris but also sug- 
gest that it was adapted to prey microphagously on mesozooplankton 
(please note the revised terminology of the group, see Supplementary 
Information). 

T. borealis is at present known from five isolated frontal appendages 
and two appendages associated with a head shield. Frontal appendages 
(Fig. 1) measure =120 mm in length, comparable in size to the later 
Anomalocaris canadensis, but the total size of the body is not known. 
As in other anomalocarids, the appendage consists of discrete, sclero- 
tized articles. All specimens are preserved with the ventral spines parallel 
to the bedding plane, and the articles show no evidence of distortion 
due to compaction. It is therefore assumed that the articles were trans- 
versely compressed, with an oval cross-section in life. The appendage 
consists of at least 18 articles, versus 14 in A. canadensis, for example. 
Articles are separated by triangular arthrodial membranes (Extended 
Data Fig. 2b, c). These extend almost to the dorsal margin of the append- 
age; ventrally, the membrane is 33-50% the length of the articles, sug- 
gesting a well-developed flexural ability. 

The appendage curves downward distally, with the strongest curv- 
ature around the second and third article. The first article is straight, 
and longer than the next three combined. It bears a single pair of ventral 


Copenhagen). b, Isolated appendage, preserving auxiliary spines in great detail, 
MGUH 30501. ¢, Detail of spine (boxed area in b). All specimens were 
photographed submerged in water with high-angle illumination. 
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spines near its distal margin, which are stout and angled backwards 
(Fig. 1a), as in A. briggsi’. The next 17 articles each bear pairs of long 
and delicate ventral spines inserted at the mid-length of the article. 
These are evenly spaced along the appendage about 5-6 mm apart. The 
spines diverge ventrally such that each pair forms an inverted V-shape. 
Unlike A. canadensis, in which longer and shorter spines alternate and 
taper distally, the ventral spines are all of similar length, measuring 26- 
27.5 mm along the full length of the appendage (Fig. 1a, band Extended 
Data Figs 1-3). A similar condition is seen in A. briggsi. The ventral spines 
curve posteriorly, again as in A. briggsi, but unlike in any other anom- 
alocarids. Individual spines appear flattened, with a median rod and 
thinner lamellar margins (Extended Data Fig. 1c). In addition, ventral 
spines are frequently kinked, and sometimes broken, suggesting that 
they were weakly sclerotized and flexible. 

As in many other anomalocarids™?’, the anterior and posterior mar- 
gins of the ventral spines bear auxiliary spines (Fig. 1c and Extended 
Data Figs 1c, 2d and 3), but they are unusually long in Tamisiocaris— 
measuring 4.2-5.0mm in length—and extremely slender. Auxiliary 
spines form a comb-like array, being spaced 0.3-0.85 mm apart, with 
a median spacing of 0.49 mm. The length and spacing are such that 
adjacent spine combs between spines would overlap or interdigitate. A 
three-dimensional digital reconstruction was produced (Fig. 2a) with 
the inferred relative proportions in place to infer its dexterity (Fig. 2b). 

One specimen consists of two associated appendages in subparallel 
orientation (Extended Data Fig. 4). Proximally, they join a large, ellip- 
tical head shield. The head shield is larger than in A. canadensis, but is 
not enlarged to the same degree as seen in Peytoia nathorsti and Hurdia 
victoria. Eyes are not preserved. 

The affinities of Tamisiocaris were examined in a cladistic analysis 
to explore its position within the anomalocarids. The analysis recovers 
a clade consisting of T. borealis and A. briggsi (Fig. 3). This clade, which 
we name the Cetiocaridae (cetus: whale, shark or other large marine 
animal; and caris: sea crab), is diagnosed by long, slender and recurved 
ventral spines, and the presence of numerous auxiliary spines. Tamisiocaris 
is more specialized, however, in having flexible ventral spines and densely 
packed auxiliary spines. The cetiocarids are a sister to Hurdiidae, a 
clade containing H. victoria, P. nathorsti and related species. Outside 
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these taxa lies a clade of plesiomorphic forms including A. canadensis, 
A. saron, Amplectobelua spp. and relatives. 

The hypothesis that T. borealis engaged in suspension feeding can 
be evaluated by comparisons with extant analogues (Extended Data 
Fig. 5). Suspension-feeding crustaceans, such as cirripedes (barnacles), 
atyid shrimp, copepods, cladocerans, mysids and euphausiaceans (rill) 
share a suite of adaptations for sieving particles out of the water column 
that are also found in the Cetiocaridae (Extended Data Fig. 5). These 
include appendages with very elongate, flexible setae and/or setules; 
regular spacing; and close spacing of setae/setules. These features cre- 
ate a net with a regular mesh size that efficiently traps all particles above 
a threshold set by the setal spacing. The feeding limbs sieve particles out 
of the water, concentrate them by contraction, and carry them to the 
mouth'®. The suspension-feeding apparatuses of vertebrates have a simi- 
lar morphology. Suspension-feeding teleosts and some sharks use a 
mesh formed by long, slender and closely spaced gill rakers. The feed- 
ing apparatus of mysticete whales consists of arrays of baleen plates 
that wear into elongate fringes’”. 

The mesh size of the capture apparatus is closely related to prey size: 
right whales specialize in feeding on small copepods (fringe diameter 
0.2 mm), whereas blue whales (fringe diameter 1 mm) feed on larger 
krill’®, A survey of diverse suspension feeders, from cladocerans to blue 
whales, shows a linear relationship between mesh size and minimum 
prey size (Fig. 4). Although larger prey can be captured, the bulk of the 
prey is close to the mesh size of the suspension apparatus. 

On the basis of the morphologies seen in modern animals, a suspension- 
feeding anomalocarid would be predicted to have evolved a setal mesh, 
with large appendages bearing long, flexible setae to increase capture 
area, with close, regular setal spacing. This is indeed the morphology 
observed in Tamisiocaris. Furthermore, the mesh dimensions can be 
used to predict the size of the prey caught by Tamisiocaris. Spacing of 
the auxiliary spines in T. borealis suggests that it could capture food 
items suspended in the water column down to 0.49 mm, whereas linear 
regression from extant suspension feeders (Fig. 4) predicts a slightly 
larger minimum particle size of 0.70 mm. Known mesozooplankton, 
from small carbonaceous fossil assemblages from the Cambrian Series 2 
(refs 9, 10), contain isolated feeding appendages from crustaceans, 


Figure 2 | A digital reconstruction of Tamisiocaris. a, Single appendage 
indicating the articulating membranes (Am), articles (Art), spines (Sp) and 


auxiliary spines (As). b, Possible sequence of movement of the frontal 
appendage of Tamisiocaris. See also Supplementary Videos 1 and 2. 
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Euarthropoda 
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Anomalocaris sp. Pioche Shale 
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Cetiocaridae 


Figure 3 | Phylogeny of anomalocarids. Strict consensus of 91 trees derived 
from an analysis of 31 taxa and 54 characters using parsimony in PAUP* 4.0 


including putative copepods. On the basis of comparisons with man- 
dibles of modern counterparts’°, the largest known specimens reached 
diameters of 1.5-2.7 mm. We suggest that feeding was accomplished 
by alternate sweeping of the appendages, with entrapped prey being 
sucked’? up by the oral cone (Supplementary Videos 1 and 2). 

In the context of the phylogenetic analysis presented here (Fig. 3), 
we see that anomalocarid clades evolved distinct frontal appendage mor- 
phologies and feeding strategies. Primitive forms such as A. canadensis 
had raptorial appendages with stout, trident-like spines, well-suited 
to impaling large, free-swimming or epifaunal prey’ (Extended Data 
Fig. 6a, b). Amplectobelua had pincer-like appendages~’ (Extended Data 
Fig. 6c, d) that would have been effective in seizing and tearing apart 
relatively large, slow-moving animals. In hurdiids, the appendages bear 
opposing pairs of spines, which may have functioned as jaws or in sed- 
iment sifting’ (Extended Data Fig. 6e, f). Finally, cetiocaridid frontal 
appendages are specialized as sweep nets (Extended Data Fig. 6g, h). 
This extraordinary range of appendage morphologies shows that, far 
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b10. T. borealis forms a clade with A. briggsi, here named Cetiocaridae. 
Numerical ages (left) are given as millions of years ago. 


from being a failed experiment, anomalocarids staged a major adaptive 
radiation during the Cambrian explosion, evolving to fill a range of niches 
as nektonic predators, much like the later radiations of vertebrates and 
cephalopods, including suspension feeders”"”””. 

The existence of suspension feeding in anomalocarids also has impli- 
cations for the structure of Early Cambrian pelagic food webs (Extended 
Data Fig. 7). It had been assumed that a diverse planktonic fauna and 
suspension-feeding animals did not evolve until the Late Cambrian” 
and thus the complexity of the pelagic food web evolved in a delayed, 
piecemeal fashion. However, the discovery of large suspension feeders 
in the Early Cambrian suggests a well-developed pelagic biota supported 
by high primary productivity and abundant mesozooplankton, because 
large animals can only exploit small prey when they exist at high densi- 
ties. Whales, whale sharks and basking sharks exploit highly productive 
areas such as upwelling zones and seasonal plankton blooms at high 
latitudes**. This general observation holds for all microphagous sus- 
pension feeders ranging from cladocerans, to anchovies, to red salmon, 


Figure 4 | Diagram depicting the 


Rainbow relationship between suspension 
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Bowhead consumed by suspension feeders. 
Sy Blue and red dots are minimum and 
maximum food particle size 
Whale respectively recorded for a given 
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the dotted line based on the average 
mesh width of 0.49 mm. The diagram 
is collated from a range of modern 
suspension feeders (see Methods). 
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to blue whales: a high density of food particles is required to sustain an 
actively swimming suspension feeder. 

Other evidence for high primary productivity in the Cambrian, such 
as vast deposits of phosphorites and increased terrestrial nutrient flux'”"*”°, 
imply that high productivity may have been a global phenomenon in 
this period. Furthermore, the Cambrian also witnessed a radiation of 
spiny acritarchs, which are thought to have lived as microscopic phyto- 
plankton, replacing larger Neoproterozoic benthic forms”*®. Complex 
minute crustacean feeding appendages also occur in Lower and Middle- 
Upper Cambrian rocks”’”’, demonstrating the presence of diverse meso- 
zooplankton, preying on phytoplankton. Abundant vetulicolians in Sirius 
Passet”° (with hundreds of specimens collected on recent expeditions) 
may also have been suspension feeding upon phytoplankton (Extended 
Data Fig. 7). One tier up, Tamisiocaris would have preyed upon the 
mesozooplankton, as would the common nektonic arthropod Isoxys 
volucris’’. Other pelagic predators known from Lagerstatten elsewhere 
would also have fed on mesozooplankton, including ctenophores, cni- 
darians, chaetognaths" and pelagic arthropods** (Extended Data Fig. 7). 
The Cambrian pelagic food web was therefore highly complex**”°, 
containing multiple trophic levels, including pelagic predators'' and 
multiple tiers of suspension feeders. This underscores the remarkable 
speed with which a modern food chain was assembled during the Cam- 
brian explosion. 

Finally, the discovery of a suspension-feeding anomalocarid has impli- 
cations for debates concerning the predictability of evolution, or lack 
thereof. One view holds that evolution is ultimately unpredictable*’. 
The notable convergence between Tamisiocaris and extant suspension 
feeders, however, suggests that although different groups occupy eco- 
logical niches at different times, the number of viable niches and viable 
strategies for exploiting them are limited. Furthermore, the derivation 
of the suspension-feeding Tamisiocaris from a large apex predator par- 
allels the evolution of suspension-feeding pachycormid fish'”", sharks 
and whales’. In each case, suspension feeders evolved from nektonic 
macropredators. This suggests that evolution is canalized not only in 
terms of outcomes, but in terms of trajectories. The result is that inde- 
pendent evolutionary experiments by animals as different as anoma- 
locarids, fish and whales have converged on broadly similar outcomes. 


METHODS SUMMARY 


Specimens were collected in the field and photographed in the laboratory, coated 
or uncoated and submerged in water. A digital reconstruction of the Tamisiocaris 
feeding appendage was made to infer the range of motions. The suspension mesh 
diameter and prey width were collected from the literature on extant suspension 
feeders to depict the linear relationship between these (see Methods). A cladistic 
analysis containing 31 taxa and 51 characters was collated and analysed in PAUP* 
4.0 b10 and TNT (see Supplementary Information). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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A primitive placoderm sheds light on the origin of the 


jawed vertebrate face 


Vincent Dupret', Sophie Sanchez”, Daniel Goujet?, Paul Tafforeau & Per E. Ahlberg! 


Extant vertebrates form two clades, the jawless Cyclostomata (lam- 
preys and hagfishes) and the jawed Gnathostomata (all other verte- 
brates), with contrasting facial architectures’”. These arise during 
development from just a few key differences in the growth patterns 
of the cranial primordia: notably, the nasal sacs and hypophysis orig- 
inate from a single placode in cyclostomes but from separate pla- 
codes in gnathostomes, and infraoptic ectomesenchyme migrates 
forward either side of the single placode in cyclostomes but between 
the placodes in gnathostomes* *. Fossil stem gnathostomes preserve 
cranial anatomies rich in landmarks that provide proxies for devel- 
opmental processes and allow the transition from jawless to jawed 
vertebrates to be broken down into evolutionary steps””"’”. Here we 
use propagation phase contrast synchrotron microtomography to 
image the cranial anatomy of the primitive placoderm (jawed stem 
gnathostome) Romundina’’, and show that it combines jawed verte- 
brate architecture with cranial and cerebral proportions resembling 
those of cyclostomes and the galeaspid (jawless stem gnathostome) 
Shuyu". This combination seems to be primitive for jawed vertebrates, 
and suggests a decoupling between ectomesenchymal growth tra- 
jectory, ectomesenchymal proliferation, and cerebral shape change 
during the origin of gnathostomes. 

One of the most distinctive features of cyclostome facial architecture 
is the median nasohypophysial duct, which forms when the posthy- 
pophyseal processes grow forward on either side of the nasohypophy- 
sial placode to create the hood (lamprey) or oronasal membrane plus 
tentacles (hagfish)**’. In gnathostomes, by contrast, these processes 
converge in the midline between the hypophysial and nasal placodes to 
create the trabeculae, while the maxillary process of the mandibular 
arch grows forward laterally to form the upper jaw'*”"*. Cyclostomes 
have very short forebrains and nasal sacs that lie well posterior to the 
upper lip of the mouth, whereas gnathostomes have longer forebrains 
and nasal sacs located above the jaw margin*’. The cyclostome hypo- 
physis projects ventrally into the nasohypophysial duct, whereas the 
gnathostome hypophysis projects posteroventrally towards the palate; 
the distance between hypophysis and nasal sacs is thus much greater in 
gnathostomes than cyclostomes. 

Comparative analysis of extant cyclostomes and gnathostomes alone 
does not allow us to polarize this transformation or break it down into 
steps. However, the lower part of the gnathostome stem group com- 
prises fossil jawless vertebrates with nasohypophysial ducts’*, showing 
that the cyclostome pattern precedes the gnathostome pattern and indi- 
cating that fossils can potentially illuminate the transformation itself 
(Fig. 1 and Extended Data Figs 1 and 2). The 430-million-year-old galeas- 
pid (jawless stem gnathostome) Shuyu seems to show an early transitional 
condition: nasal sacs and hypophysis are separated by a small trabecular 
process, but open into a common nasohypophysial duct"’. This suggests 
that the posthypophysial processes of Shuyu followed a lateral growth 
path as in cyclostomes, but that nasal and hypophysial placodes were 
separate, and that a small component of the premandibular ectome- 
senchyme grew into the midline above the hypophysial placode to form 
the trabecular process (Supplementary Information). 


Placoderms (extinct armoured jawed fishes) form the paraphyletic 
upper part of the gnathostome stem group”'®””. They may thus include 
the most primitive examples of jawed vertebrate morphology. Romundina 
stellina Orvig, 1975 (ref. 13) (Fig. 2 and Extended Data Figs 3 and 4), 
from the earliest Devonian period (410-419 million years ago) of Canada, 
is one of a small number of placoderms that differ from all other jawed 
vertebrates by having nasal capsules positioned between the eyes, well 
behind the upper lip. These ‘posterior-nosed’ forms, which also include 
Brindabellaspis and antiarchs, are consistently recovered among the 
phylogenetically deepest placoderms”'*’*'>"° (Fig. 1, Supplementary 
Information and Extended Data Figs 1 and 2). The position of the cap- 
sules suggests that they had short forebrains, but their endocranial anat- 
omy has remained largely unknown'’*’”"’. The only properly documented 
cranial cavity of a posterior-nosed form is that of Brindabellaspis, recon- 
structed from two acid-prepared braincases’*. Brindabellaspis has some 
puzzling features, such as nasal sacs located within the orbits’*, which 
make it difficult to assess whether its endocranial anatomy is repre- 
sentative for posterior-nose placoderms in general. 

Here we present the braincase of Romundina, reconstructed from 
a propagation phase contrast synchrotron microtomography (PPC- 
SRuCT) scan of specimen MNHN CPW1 (Fig. 2a, b and Supplemen- 
tary Video 1). The specimen is complete apart from an oblique ventral 
breakage and the absence of the rostronasal capsule. This capsule, com- 
prising the nasal capsules and their dermal bone cover, was separated 
from the braincase by an encircling optic fissure (Fig. 2a, b); it is known 
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Figure 1 | Phylogenetic position of Romundina. Simplified vertebrate 
phylogeny, based on the character matrix reported previously’* with the 
addition of Romundina, showing the distribution of different nasal 
architectures. Arrowheads indicate extant groups. For full analysis see 
Supplementary Information. 
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from other specimens of Romundina’*”° (Extended Data Fig. 4), allowing 
us to reconstruct its position and internal spaces (Figs 2c-fand 3, Extended 
Data Fig. 3c-f and Supplementary Video 2). The posterior part of the 
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Figure 2 | Endocranial anatomy of Romundina. a, Photograph of specimen 
MNHN CPW1 in left anterodorsolateral view. The red box indicates the area 
enlarged in b. b, Orbital region and snout in close-up view. c, Braincase and 
skull roof in left lateral view, modelled from PPC-SRuCT scan. Red box 
indicates region shown in d-f. d-f, Cranial cavity and associated structures in 
lateral (d), dorsal (e) and ventral (f) views. Outline of braincase shown in white. 
Rostronasal capsule, nasal sacs and olfactory bulbs are reconstructed from 
previously reported data'* (see also Supplementary Information). e and f both 
extend slightly across the midline so that midline structures such as nerve 0 and 
the hypophysial fossa are shown complete. Scale bars: a, 10 mm; b-d, 2 mm; 
e, f, 1mm. All images are original. 

braincase is broadly similar to those of other early placoderms’*"”7"- 
and will be the subject of a subsequent publication. We focus here 
on the anterior region, from the canal for the hyomandibular branch 
of the facial nerve (VII(hm)) to the tip of the snout (Fig. 2d-f and 
Extended Data Fig. 3d-—f), which differs radically from the correspond- 
ing region of an extant gnathostome. 

The telencephalic portion of the cranial cavity is extremely short 
(Figs 2c-e, 3). It is bounded anteriorly by the lamina cribrosa of the 
rostronasal capsule (Extended Data Fig. 4 and ref. 13), which carries on 
its posterior face a central pair of foramina for the terminal nerve (nerve 0) 
and two lateral concavities for the olfactory bulbs. Posterior to the optic 
foramen (nerve II), which straddles the optic fissure, the diencephalic 
portion of the cranial cavity passes down to the anteroventrally directed 
hypophysial recess. Large recesses for the nerve V ganglia mark the level 
of the pons. Nasal-hypophysial distance is much smaller than in an 
extant gnathostome (Fig. 3). Similar proportions characterize the cra- 
nial cavities of the posterior-nosed placoderms Brindabellaspis’* and 
Jagorina”’, the galeaspid Shuyu”, and the brains of extant cyclostomes’* 
(Supplementary Information). We conclude that cerebral proportions 
remained substantially unchanged through the transition from jawless 
to jawed cranial architecture, and that the characteristic elongate telen- 
cephalon and posteroventrally oriented hypophysis of extant gnath- 
ostomes evolved later (Fig. 3). 

The braincase of Romundina includes a large precerebral region and 
broad suborbital shelves that terminate caudally at the canal for the 
hyomandibular branch of the facial nerve (nerve VII(hm)) (Fig. 2 and 
Extended Data Fig. 3). The suborbital shelves carry a series of articula- 
tions for the palatoquadrate (known from detached examples’*) from 
the anterior end of the precerebral region to immediately in front of the 
VII(hm) nerve foramen. The spatial relationship of this region to the 
palatoquadrate and rostronasal capsule, and the fissure separating it 
from the latter, indicate that the precerebral braincase and suborbital 
shelves are formed by the trabeculae. In extant gnathostomes, the pos- 
terior part of the trabeculae consists of infraoptic ectomesenchyme, 
whereas the anterior part seems to contain varying components of supra- 
and infraoptic ectomesenchyme, depending on taxon”*”*; the nasal cap- 
sules are always composed of supraoptic ectomesenchyme. In Romundina, 
the sharp anterior demarcation and bounding fissure of the rostronasal 
capsule suggest that it formed the anteriormost part of the supraoptic 
ectomesenchyme and that the whole trabecular region was composed 
of infraoptic material (Supplementary Information and Figs 3 and 4). 

The braincase of Brindabellaspis resembles that of Romundina in 
having a long precerebral region’*. The partly preserved braincase of 
the antiarch Minicrania and attachment scars on the dermal skull bones 
of other antiarchs indicate a similar configuration’’”*. The rostronasal 
capsule was small and fissure bounded in antiarchs (as indicated by the 
dermal component), whereas in Brindabellaspis the capsule was fused 
to the braincase and the nasal sacs lay in the anterior corners of the 
orbits'”"*. We conclude that the anatomical interpretation and inferred 
ectomesenchymal map of Romundina are applicable to posterior-nosed 
placoderms in general. 

Although Romundina is an unambiguous jawed vertebrate, the pro- 
portions of its brain closely resemble those of jawless vertebrates. Further- 
more, its facial morphology suggests that the region formed from infraoptic 
ectomesenchyme was proportionately much larger than in an extant 
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gnathostome and projected anteriorly beyond the supraoptically derived 
rostronasal capsule (Figs 3 and 4). This resembles the condition in extant 
cyclostomes, where infraoptic material forms the bulk of the upper lip or 
oronasal membrane (Fig. 3). The phylogenetic position of Romundina 
and the similar morphology of other posterior-nosed placoderms suggest 
that these characteristics are primitive for jawed stem gnathostomes. 
Immediately crownward to Romundina, our phylogenetic analysis recov- 
ers the clade Arthrodira (Figs 1 and 3, Supplementary Information and 
Extended Data Figs 1 and 2). Primitive arthrodires such as Kujdanowiaspis'” 
and Dicksonosteus” have terminal rostronasal capsules, but their fore- 
brains are still very short. Comparison with Romundina (Fig. 3) indi- 
cates that this configuration was produced by shortening the trabecular 
region anteriorly. In the gnathostome crown group the telencephalon, 
trabecular region and supraoptic ectomesenchymal territory have all 
been lengthened substantially, but their relative positions and propor- 
tions remain similar to those of arthrodires (Fig. 3). 

The transformation from cyclostome to gnathostome facial architec- 
ture has been described as a heterotopic phenomenon, involving changes 
in ectomesenchymal growth trajectories and ectoderm-—ectomesenchyme 
interactions’. The data from stem gnathostomes support this interpre- 
tation but add evidence for the intermediate steps of the transformation 
and the decoupling of different processes (Fig. 3). Separate but closely 
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Figure 3 | Transformation of the vertebrate face 
and brain. a, Phylogeny of representative 
vertebrates, with diagrammatic sagittal sections of 
their heads. Heads are aligned on the exits of the 
optic nerve (II) and vagus nerve (X); cyclostomes 
are shown at a smaller scale because of their short 
brains and large precerebral regions. Bar diagram 
beneath each head shows the longitudinal position 
and extent of cranial components. Posterior 
boundary of supraoptic ectomesenchyme is 
positioned arbitrarily at the level of nerve II. 
Hagfish and lamprey are modified from ref. 7, 
Shuyu is based on data from ref. 11, with 
permission, all other images are original. 

C, cyclostome crown group node; G, gnathostome 
crown group node. Arrows indicate important 
evolutionary steps. b, Brains/cranial cavities of 
the same taxa, aligned on the optic nerve (II) and 
vagus nerve (X), all in lateral view except 
Kujdanowiaspis, which is shown in composite 
dorsal (top)/ventral (bottom) view because no 
lateral view is available. Diagrams of brains of 
hagfish, lamprey, Erpetoichthys are modified with 
permission from ref. 29; brain of Squalus is 
modified with permission from ref. 30; cranial 
cavity of Shuyu is modified with permission from 
ref. 11; cranial cavity of Kujdanowiaspis is modified 
from ref. 23. 
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spaced nasal and hypophysial placodes, associated with a short telen- 
cephalon, were established before the origin of jaws and retained an 
unchanged geometry across the transition from agnathans to jawed 
vertebrates. This shows that the heterotopic shift in ectomesenchymal 
growth trajectory, which replaced the nasohypophysial duct with a 
solid trabecular region, was not driven by a lengthening of the telen- 
cephalon, leading to wider separation of the nasal and hypophysial 
placodes. On the contrary, we suggest that the creation of a trabecular 
region contributed to making this lengthening possible. By forming a 
horizontal shelf between the nasal sacs and hypophysis, the trabeculae 
also became a skeletal floor for the telencephalic part of the cranial cavity, 
which could support and protect a long telencephalon. In cyclostomes 
the telencephalon has no ventral support’. Consonant with this hypo- 
thesis, telencephalic lengthening occurred independently at least twice 
among placoderms (in Macropetalichthys and pholidosteid arthrodires”*) 
as well as in crown gnathostomes. Even within the crown group, some 
early members such as the sarcopterygian Tungsenia” retain a relatively 
short telencephalon, although the nasal-hypophysial distance is greater 
than in Romundina or primitive arthrodires. 

When the trabecular region first evolved, its dimensions were sim- 
ilar to an agnathan upper lip (Figs 3 and 4), suggesting that the change 
of growth trajectory for the premandibular ectomesenchyme was not 
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Figure 4 | Inferred neural crest composition of Romundina. PPC-SRuwCT 
model of Romundina specimen MNHN CPW1 in left anterodorsolateral view, 
with reconstructed rostronasal capsule, showing inferred extent of 
premandibular infraoptic ectomesenchyme (pink) and supraoptic 
ectomesenchyme (blue). 


initially associated with a reduction in cell proliferation to produce a 
smaller structure; relative size reduction of the trabeculae, which pro- 
duced the spatial configuration characteristic of extant gnathostomes, 
came later. This implies that trajectories and proliferation patterns 
must have been regulated separately during the morphological trans- 
formation from jawless to jawed facial architecture. We predict that 
separate regulatory networks for these attributes are also present in 
extant vertebrates. 


METHODS SUMMARY 


Specimens of Romundina stellina were partly freed from the surrounding lime- 
stone matrix by dilute formic acid. One specimen was imaged with PPC-SRuCT at 
beamline ID19 of the European Synchrotron Radiation Facility (see Supplemen- 
tary Information and ref. 28 for technical specifications). Mimics v.12.3 and v.13.1 
(Materialise) were used for three-dimensional modelling. Pictures and animations 
were rendered in Maya 2012 (Autodesk). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Visual space is compressed in prefrontal cortex 


before eye movements 


Marc Zirnsak’?, Nicholas A. Steinmetz!, Behrad Noudoost!, Kitty Z. Xu! & Tirin Moore!? 


We experience the visual world through a series of saccadic eye move- 
ments, each one shifting our gaze to bring objects of interest to the 
fovea for further processing. Although such movements lead to 
frequent and substantial displacements of the retinal image, these 
displacements go unnoticed. It is widely assumed that a primary mech- 
anism underlying this apparent stability is an anticipatory shifting 
of visual receptive fields (RFs) from their presaccadic to their post- 
saccadic locations before movement onset’. Evidence of this pre- 
dictive ‘remapping’ of RFs has been particularly apparent within 
brain structures involved in gaze control’*. However, critically absent 
among that evidence are detailed measurements of visual RFs before 
movement onset. Here we show that during saccade preparation, 
rather than remap, RFs of neurons in a prefrontal gaze control area 
massively converge towards the saccadic target. We mapped the visual 
RFs of prefrontal neurons during stable fixation and immediately 
before the onset of eye movements, using multi-electrode record- 
ings in monkeys. Following movements from an initial fixation point 
to a target, RFs remained stationary in retinocentric space. However, 
in the period immediately before movement onset, RFs shifted by 
as much as 18 degrees of visual angle, and converged towards the 
target location. This convergence resulted in a threefold increase in 
the proportion of RFs responding to stimuli near the target region. 
In addition, like in human observers”*, the population of prefrontal 
neurons grossly mislocalized presaccadic stimuli as being closer to 
the target. Our results show that RF shifts do not predict the retinal 
displacements due to saccades, but instead reflect the overriding per- 
ception of target space during eye movements. 

We recorded from neurons within the frontal eye field (FEF) of mon- 
keys (Macaca mulatta) using linear electrode arrays (Fig. la and Methods). 
The FEF is an area of prefrontal cortex with a known involvement 
in gaze control’ and visual attention*"'’. Previous studies have found 
evidence that visual RFs of FEF neurons predictively remap before 
saccades". That is, this body of evidence suggests that FEF RFs shift 
from their presaccadic locations to their anticipated, postsaccadic loca- 
tions before the onset of each saccade (Extended Data Fig. 1). However, 
because these studies inferred RF shifts from visual responses to stimuli 
presented at only a few locations, the validity of the remapping frame- 
work remains uncertain. We therefore mapped the RFs of simulta- 
neously recorded FEF neurons with flashed (25 ms) ‘probe’ stimuli while 
monkeys performed a standard saccade task* (Methods and Extended 
Data Fig. 2). To obtain detailed measurements of RFs during the task, 
we used a dense array of visual probes: 10 X 9 positions covering an 
area of 36 X 32 degrees of visual angle (dva) (Fig. 1b). Using this arrange- 
ment, we mapped RFs during three separate periods: during fixation 
at each of two fixation points (FP1 and FP2) and just prior (69 ms, 
s.d. = 35) toa saccade from FP1 to FP2. Figure 1c shows four examples 
of FEF RFs mapped during fixation at FP1 and FP2, and their corres- 
ponding RF centres (RF1s and RF2s) (Methods). In each example, the 
change in fixation from one location to the other was accompanied by 
a RF displacement that was equivalent to the displacement of fixation, 
reflecting the retinocentric property of FEF RFs. RFs measured at FP2 


also served as empirical estimates of the expected shifts due to pre- 
dictive remapping. When measured while monkeys were still fixating 
at FP1, but preparing saccades to FP2, RFs were considerably different 
from those measured during stable fixation. As shown in the examples, 
presaccadic RF centres (PRFs) differed both from the RF1s and the RF2s 
(Fig. 1d). Furthermore, PRFs tended to be much closer to the saccade 
target (FP2), in some cases shifting from their RF1 location in a direc- 
tion orthogonal (Fig. 1c, example 3) or opposite (Fig. 1c, example 4) to 
the saccade direction, and thus were inconsistent with remapping. 

We measured the changes in RF2s and PRFs from their corresponding 
RF 1s (Agyx and Apag, respectively) for a population of 179 RFs mapped 
in two monkeys (Fig. 2a and Methods). As expected, for a fixed saccade 
vector, the average displacement of RF2s (11.2 dva, sd. = 1.5) was approx- 
imately equal to the saccade amplitude (12 dva, s.d. = 0.2), and was 
independent of the RF1 distance to the saccade target (é’ gr) (r = 0.05, 
P= 0.49) (Methods). Without presaccadic RF modulation, we would 
expect PRFs to be identical to their corresponding RF1s, given that ret- 
inal stimulation is essentially identical in these two conditions. How- 
ever, FEF RFs are considerably altered during saccade preparation, 
and are shifted by an average of 8.4 dva (s.d. = 3.6) and as much as 
18 dva (Methods and Extended Data Fig. 3). This shift was greater 
(P<10 "°) than the small variations in RFs measured between the fixa- 
tion conditions (Extended Data Fig. 4). Furthermore, in contrast to RF2s, 
PRFs depended on the RF1 distance to the saccade target (r = 0.44, 
P<10 ”), with larger PRF shifts occurring for more distant RFs. Note 
that this dependence should not occur if the presaccadic RFs predic- 
tively shift from RF 1s to their postsaccadic RF2s. Moreover, the angular 
differences (#') between the RF2 displacement vectors and the PRF shift 
vectors were not uniform as expected with remapping, but depended 
on the angular deviation (0) of RF1 from the saccade vector (r = 0.75, 
P<10 1) (Fig. 2b). Thus, we observed substantial presaccadic shifts 
that were inconsistent with the remapping prediction (Extended Data 
Fig. 5). Furthermore, the overall pattern of RF shifts reveals how indi- 
vidual shifts can appear consistent with remapping at some locations in 
space (Methods and Extended Data Fig. 6). 

To further understand the presaccadic RF modulation, we examined 
PREF shift vectors across the entire distribution of measured RF loca- 
tions (Fig. 2c, Methods and Extended Data Fig. 7). We observed that 
PRFs shifted in the direction of the saccade target independent of their 
corresponding RF1 location. On average, PRFs were 6.1 dva closer to 
the saccade target than the RFls (P< 10 1°). We further compared 
PRFs to the remapping prediction, using the RF2s as an empirical esti- 
mate, and found PRFs to be closer to the saccade target than expected 
with remapping (P< 10 '°). Thus, PRFs deviated from the remapp- 
ing prediction and instead converged towards the saccade target. This 
pattern of results was the same when considering only single neurons, 
indicating that the convergence reflected the shifting of individual neu- 
ronal RFs rather than a differential gain change across multiple neurons 
(Methods and Extended Data Figs 8 and 9). 

We next examined how presaccadic changes in RFs altered the rep- 
resentation of visual space by the population of recorded FEF neurons. 
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Figure 1 | Multi-electrode recordings and high-density mapping of FEF 
neuronal RFs. a, The FEF in the macaque cortex (left) and in a coronal 
magnetic resonance image from monkey N (middle). Right, linear array 
microelectrode and traces of FEF visual responses recorded simultaneously 
across 16 electrode contacts. b, FEF RFs were mapped with a 10 X 9 array 
(36 X 32 dva) of probe stimuli (squares) flashed during fixation at FP1 and FP2 
and immediately before saccades from FP1 to FP2. c, Four example neuronal 
RE maps. Left, mean peri-stimulus response histogram for the most effective 


Rather than focus solely on RF centres, however, we instead measured 
how saccade preparation altered the degree to which each visual probe 
elicited responses from the neuronal population (Methods). Figure 3a 
shows the percentage of ‘population RFs’ yielding responses across all 
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probe location during fixation at FP1 (top) and FP2 (bottom). Blue traces and 
circles indicate eye position and fixation location, respectively. d, Presaccadic 
RF maps of the same neurons mapped immediately before saccades from 
FP1 to FP2. Left, mean peri-stimulus response histogram for the most effective 
probe location. Probe presentation (vertical grey shading) was completed 
before saccadic onset on all trials. Bottom, measurement of changes in RF 
centres (black crosses, RF1, RF2 and PRF) during fixation (grey vectors) and 
saccade preparation (gold vectors). 


probe locations during fixation and saccade preparation. These distri- 
butions of ‘RF density’ illustrate how the representation of visual space 
is displaced by the change in fixation from FP1 to FP2 in the popu- 
lation. Moreover, these RF density distributions revealed a substantial 


Figure 2 | Shifting of FEF neuronal RFs before 
saccade onset. a, The presaccadic shift amplitude 
(Aprg) as a function of the distance of RF1 from the 
saccade target (FP2) (¢’ pp) for 179 FEF RFs. 

The distance between RF1 and RF2 (Apgyx) as a 
function of é’ gg, is shown for comparison. Lines 
denote linear regression fits. b, The angular 
deviation of the presaccadic RF shift from the 
remapping prediction (#') as a function of 0. 

c, Comparison of the population of presaccadic RF 
shifts (gold vectors) with the remapping prediction 
(grey vectors). Vectors were averaged within the 

6 X 6 dva spatial bins depicted in the lower right 
along with the distribution of individual RF 1s. 
Top right plot shows the distances between the PRF 
and the saccade target compared with the distances 
between RF2 and the saccade target. Black line 
denotes the line of unity. Vector origins and end 
points are based on, respectively, RF1 and RF2 and 
RF1 and PRF. 
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Figure 3 | Fixation and presaccadic population RF densities of recorded 
FEF neurons. a, Panels plot the percentage of population RFs responsive for a 
given probe location. Blue circles denote the location of fixation. b, Panels show 


effect of saccade preparation on the representation of space. Specif- 
ically, we observed an increase in RF density centred on the saccade 
target. Within a 20 X 20 dva region around the saccade target, RF den- 
sity increased by more than twofold (106.4%, P< 10 *). Note the more 
than threefold increase in the proportion of RFs with responses at 
locations nearest the target (Fig. 3b). These changes in RF density were 
accompanied by alterations in the presaccadic spike count correlations of 
simultaneously recorded neurons (Methods and Extended Data Fig. 10), 
suggesting changes in effective connectivity'** during saccade prepa- 
ration. The above observations point towards a substantial enhance- 
ment in the representation of visual space at the saccade target. 
Finally, we considered whether the presaccadic convergence of RFs 
to the saccade target results in a distorted ‘read-out’ of stimulus loca- 
tion. In human observers, stimulus location during saccade execution 
is grossly misjudged. This mislocalization results in a ‘compression of 
visual space’, with observers reporting stimuli as appearing much closer 
to the saccade target than they actually are**. Therefore, we decoded 
probe positions from the full population of recorded FEF neurons dur- 
ing fixation at FP1 (Fig. 4a) and during saccade preparation (Fig. 4b) 
(Methods). During fixation, the location of probe stimuli near FP1 
(x = [-10, —2], y = [—10, 6]), where most of the RFs were sampled, 
could be accurately decoded from the population response (average error 
= 1.3 dva). By contrast, during saccade preparation, probe locations 
within the same region of space were grossly mislocalized by the popu- 
lation of neurons (average error = 7 dva). Furthermore, the error was 
systematic in that the population response consistently mislocalized 
probes as being closer to the saccade target (Fig. 4c). The distance between 
the population estimate of probe location and the saccade target was 
reduced by 47% compared to fixation (—3.7 dva, P< 10°) and by 
46% compared to veridical (—3.6 dva, P< 10“). Thus, the convergence 
of RFs resulted in a compression of visual space towards the target. 
Predictive remapping of RFs is widely assumed to be the mechanism 
by which perceptual stability is achieved during saccades’, specifically 
by a global anticipatory updating of visual space. Although our results 
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Figure 4 | Localization of probe stimuli by the population of recorded FEF 
neurons. a, Population estimate of probe locations within a 16 X 16 dvaarea of 
visual space during fixation at FP1. Vector origins denote the veridical probe 
location and the end points denote the average population estimate of that 
location. Vector lengths denote the errors, measured as the average Euclidean 
distance to the veridical probe location. b, Population estimate of probe 
locations during saccade preparation to FP2. c, Reconstruction of probe 
stimulus grid from population estimates during fixation (top, dark grey) and 
during saccade preparation (bottom, gold). 
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demonstrate robust presaccadic shifts of FEF RFs, those shifts clearly 
violate the remapping prediction, and instead reveal a compression of 
visual space towards the saccade target. This observation raises an impor- 
tant question about the role of RF shifts in maintaining stability across 
eye movements. Specifically, how, if at all, might the convergence of 
RFs, rather than remapping, contribute to stable vision? It has been 
proposed by some that visual stability may be due to a strong bias of 
perceptual processing towards the targets of saccades. This hypothe- 
sis argues that the failure to detect retinal image displacements results 
from a reduced representation of non-target locations, as compared to 
the overriding perception of target space’’. Consistent with this hypo- 
thesis is psychophysical evidence of enhanced perception at the saccade 
target before movement onset’®"’, as well as enhanced visual cortical 
signals'*", Furthermore, the perception of visual space is massively 
compressed before saccades*®. Our results reveal a neuronal correlate 
of these perceptual effects. In particular, we found that populations of 
FEF neurons grossly mislocalize stimuli as being closer to the target, 
resembling psychophysical compression. Thus, regardless of the role 
of the above perceptual phenomena in visual stability, the representa- 
tion within the FEF mirrors them. 

FEF neurons have been causally implicated in the control of visual 
attention® and the corresponding modulation of stimulus-driven activity 
in posterior visual cortex”'°’?*. Several recent studies suggest that the 
influence exerted by FEF neurons on visual cortex during attention 
originates from predominantly visual signals''”°. Our results indicate 
that FEF visual signals conveyed to posterior areas before saccades grossly 
over-represent the space occupied by the target. Thus, before each eye 
movement, or during covert attention*®”’, feedback from FEF neurons 
may impose the same distortion onto visual cortex”*””, and this biased 
representation of target space could result in the aforementioned atten- 
tional enhancement within that space. 


METHODS SUMMARY 


Allexperimental procedures were in compliance with the US Public Health Service 
policy on the humane care and use of laboratory animals and the Stanford Uni- 
versity Animal Care and Use Committee. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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The E3 ligase Cbl-b and TAM receptors regulate 
cancer metastasis via natural killer cells 


Magdalena Paolino!, Axel Choidas?, Stephanie Wallner’, Blanka Pranjic', Iris Uribesalgo’, Stefanie Loeser', Amanda M. Jamieson‘, 
Wallace Y. Langdon”, Fumiyo Ikeda', Juan Pablo Fededal, Shane J. Cronin!, Roberto Nitsch', Carsten Schultz-Fademrecht?, 

Jan Eickhoff*, Sascha Menninger”, Anke Unger”, Robert Torka°, Thomas Gruber?, Reinhard Hinterleitner?, Gottfried Baier?, 
Dominik Wolf*”, Axel Ullrich®, Bert M. Klebl? & Josef M. Penninger! 


Tumour metastasis is the primary cause of mortality in cancer patients 
and remains the key challenge for cancer therapy’. New therapeutic 
approaches to block inhibitory pathways of the immune system have 
renewed hopes for the utility of such therapies”. Here we show that 
genetic deletion of the E3 ubiquitin ligase Cbl-b (casitas B-lineage 
lymphoma-b) or targeted inactivation of its E3 ligase activity licenses 
natural killer (NK) cells to spontaneously reject metastatic tumours. 
The TAM tyrosine kinase receptors Tyro3, Axl and Mer (also known 
as Mertk) were identified as ubiquitylation substrates for Cbl-b. 
Treatment of wild-type NK cells with a newly developed small mole- 
cule TAM kinase inhibitor conferred therapeutic potential, efficiently 
enhancing anti-metastatic NK cell activity in vivo. Oral or intraper- 
itoneal administration using this TAM inhibitor markedly reduced 
murine mammary cancer and melanoma metastases dependent on 
NK cells. We further report that the anticoagulant warfarin exerts 
anti-metastatic activity in mice via Cbl-b/TAM receptors in NK cells, 
providing a molecular explanation for a 50-year-old puzzle in cancer 
biology’. This novel TAM/CbI-b inhibitory pathway shows that it 
might be possible to develop a ‘pill’ that awakens the innate immune 
system to kill cancer metastases. 

Genetic ablation of Cb/-b or functional inactivation of its E3 ligase 
activity in mice results in CD8* T-cell-mediated rejection of primary 
tumours in several different models*®. In a control experiment sub- 
cutaneous injection of TC-1 tumour cells into Cbl-b-’~ Rag2"’~ mice, 
which lack B and T cells, caused a significant delay in tumour growth 
compared to CbI-b*’* Rag2’~ littermates (Fig. 1a), suggesting that 
innate immune cells are also involved. Immunohistochemistry revealed 
increased numbers of NK cells infiltrating the tumours of Cbl-b~’~ 
mice (Extended Data Fig. 1a). Cbl-b is expressed in murine and human 
NK cells; expression levels were not altered in NK cells from E3 ligase- 
defective mice (C373A*“”"), which carry a cysteine to alanine knock-in 
(KJ) mutation at position 373 (ref. 7) (Fig. 1b and Extended Data Fig. 1b). 
Loss of Cbl-b expression (CbI-b~’—) or the C373A5' mutation had 
no overt effects on NK cell development (Extended Data Fig. Ic, d). 

CbI-b-‘~ and C373A*"“' NK cells exhibited significantly increased 
proliferation and interferon (IFN)-y production when activated in vitro 
and were also more efficient in killing the prototypic NK cell target 
tumour cell line RMA-S in vivo (Fig. 1c, dand Extended Data Fig. le-)). 
In contact with YAC-1 target tumour cells, Cbl-b~/~ NK cells also 
displayed a higher capacity to kill, produce IFN-y, degranulate, secrete 
granzyme B, and to express higher levels of the cytotoxic mediator per- 
forin; knockdown of CBL-B in the human NK cell line NKL also resulted 
in enhanced cytotoxicity towards Jurkat cells (Extended Data Fig. 2a—h). 
NK cell immunodepletion using anti- NK1.1 antibodies and functional 
blockade of NKG2D (also known as Kirk1) receptors abolished anti- 
TC-1 tumour responses in Cbi-b-’~ and C373A! mice (Extended 


Data Fig. 3a—c). Moreover, subcutaneous B16F10 melanomas were slower 
growing in CbI-b-/~ and C373A“”' mice; depletion of NK1.1* cells 
reduced this increased survival of melanoma-bearing mice (Extended 
Data Fig. 3d-i). Thus, Cbl-b, via its E3 ligase activity, negatively regulates 
NK cell functions and controls NK-cell anti-tumour responses. 

We next analysed whether the absence of Cbl-b can potentiate the 
anti-metastatic activity of NK cells. Three weeks after intravenous B16F10 
melanoma challenge, Cbl-b-/~ and C373A*"' mice exhibited reduced 
lung metastases and increased survival (Fig. le and Extended Data Fig. 4a-e). 
Immunodepletion of NK1.1* cells caused uncontrollable tumour growth 
in all Cbl-b*’*, Cbl-b*’”, Cbl-b-’~ and C373A*”"' mice (Fig. 1f, g). 
NKG2D blockade in Cbl-b~’~ and C373A*”' mice also prevented the 
reduction oflung melanomas (Extended Data Fig. 4f-i). Of note, B16F10 
melanomas by themselves do not express the NKG2D ligand Rael, 
suggesting that this ligand is expressed on the tumour microenviron- 
ment. Cbl-b-/~ and C373A*”“' mice also exhibited significantly reduced 
metastases to extrapulmonary organs (Extended Data Fig. 5a—d). In the 
absence of NK cells, all Cbl-b*/*, Cbl-b*’~, Cbl-b-’~ and C373A*/! 
mice displayed secondary metastases in multiple organs (Fig. 1h, i), 
even at a lower tumour dose (Extended Data Fig. 5e-g). Immunodepletion 
of CD8™ T cells had no overt effect on the anti-metastases response of 
Cbi-b~/~ and C373Ae™™' mice (Extended Data Fig. 5h, i). When back- 
crossed to perforin mutant mice (Prf1-’~), Cbl-b-’~ Prfl~’~ double 
mutants were unable to reduce melanoma metastases (Fig. 1j, k and 
Extended Data Fig. 5j). Cbl-b has been implicated in anergic responses 
of natural killer T (NKT) cells*, a cell type that also expresses NK1.1 
(ref. 9). To provide definitive proof that Cbi-b mutant NK cells directly 
reject metastatic melanomas, we set up a therapeutic transplantation 
model. Significantly fewer B16F10 metastases were observed in the 
lungs of wild-type mice that received Cbl-b~/~ or C373A*"' NK cells, 
compared to untreated mice and mice receiving Cbl-b*’* NK cells 
(Fig. 2a—e). Thus, NK cells lacking Cbl-b, or defective in its E3 catalytic 
activity, are sufficient to inhibit the progression of lung melanomas and 
distant metastases. 

To extend our findings to a spontaneous tumour metastasis model, 
Cbl-b mutant animals were crossed to NeuT* transgenic mice that 
develop metastatic breast tumours’’. Although the onset of palpable 
mammary tumours was comparable, loss of Cbl-b markedly suppressed 
the growth of mammary tumours resulting in significantly prolonged 
overall survival (Extended Data Fig. 6a-d). Although the number of 
metastatic lung tumour foci was comparable between CbI-b*’~ NeuT* 
and Cbl-b~’~ NeuT™ mice, Cbl-b~’~ NeuT* mice showed a significant 
reduction in the tumour-to-lung area (Fig. 2fand Extended Data Fig. 6e-g). 
In Cbl-b-‘’~ NeuT* mice we observed a significant accumulation of 
NK cells within the metastatic foci (Fig. 2g and Extended Data 6h, i); 
depletion of NK cells with anti-asialo GM1 antibodies markedly enhanced 
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Figure 1 | CbI-b mutant NK cells control 
metastatic melanomas. a, TC-1 tumour growth in 
Chi-b*’* Rag2-/~ and Cbl-b-’~ Rag2‘~ mice 
(mean + s.e.m., 1 = 10 each). ***P < 0.001 (two- 
way analysis of variance (ANOVA), Bonferroni’s 
post-hoc test). b, Cbl-b and B-actin protein 
expression in CbI-b*’*, Cbl-b*/~ and Cbl-b-'~ 
NK cells. ¢, IEN-y* Cbl-b’* and Cbl-b-/~ NK 
cells (%) following anti-NKG2D stimulation 
(mean = s.e.m., 1 = 3). ***P < 0.001 (two-way 
ANOVA, Bonferroni’s post-hoc test). d, In vivo 
NK cell cytotoxicity towards RMA-S cells 

(mean + s.e.m., n = 16/10/15/14). *P < 0.05, 
**P < 0.01 (Student’s t-test). e, f, Representative 
lung melanoma metastases in control (e) or 
NK.1.1-depleted Cbl-b*/*, Cbl-b*/~, Cbl-b~/— 
and C373A"*! mice (f) at day +21. g, BI6F10 
tumour-to-lung ratios (mean + s.e.m.) of control 
and NK1.1*-depleted Cbi-b*’* (n = 12/6), Cbl- 
b*’” (n= 6/4), Cbl-b /~ (n = 9/4) and C373AK”! 
(n = 7/4) mice. ***P < 0.001 (Student’s t-test). 

h, Representative B16F10 extrapulmonary 
metastases in a NK1.1* cell-depleted mouse. 

i, Extrapulmonary metastases in control or NK1.1- 
depleted mice (lines are median, day + 16-21) 
**P < 0.01, ***P < 0.001 (Mann-Whitney 
U-test). j, k, Representative B16F10 lung 
metastases (j) and tumour-to-lung ratios 

(k) (mean + s.e.m., day +21) in Cbl-b*/— Prfi iui 
Cbl-b-/— Prfl + Cbl-b'” Prfl-’~ and Cbl- 
b-/~ Prfl~’~ mice. n = 5/7/8/8. **P < 0.01, 

***P < 0.001; NS, not significant (one-way 
ANOVA, Tukey’s post-hoc test). 


Figure 2 | Therapeutic anti-tumour activity of 
CbI-b mutant NK cells. a—d, Representative lung 
metastases (a), tumour-to-lung ratios (b), numbers 
of tumour foci (c), prevalence (d) and numbers 
of extrapulmonary metastases (e) at day +14 in 
C57BL/6] (B6) mice untreated or adoptively 
transferred with Cbi-b*/*, CbI-b~’~ and 
C373A*"' NK cells. For b and c, n = 6/6/6/7, 

*P < 0.05, **P < 0.01 (Mann-Whitney U-test). 
For d and e, n = 16/22/14/15, *P < 0.05 (Chi- 
squared test in d and Mann-Whitney U-test in e). 
f, g, Metastases (f, haematoxylin and eosin, 
asterisk) and NKp46* NK cells (g, arrows) in lungs 
of Cbl-b*’” NeuT* and Cbl-b “ NeuT* mice. 
n= 17/20. h, i, Representative haematoxylin and 
eosin-stained lung metastases (h, arrows) and 
number of metastatic foci (i) in control and 
anti-asialo GM1-treated Cbl-b ’ NeuT™ mice. 
n= 11/10. *P < 0.05 (Mann-Whitney U-test, 
lines are median). Scale bars, 50 1M (f, g) and 
2,000 11M (h). 
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metastases of mammary tumours into the lungs of Cbl-b-’~ NeuT* 
and Cbi-b*’~ NeuT* mice (Fig. 2h, i and Extended Data Fig. 7a-e). 
Deletion of both or even one Prf] allele again abrogated the capacity of 
Cbl-b-’~ NeuT* mice to control mammary metastases to the lung 
(Extended Data Fig. 7f-h). Thus, Cbl-b-deficient NK cells can also con- 
trol metastatic tumours in oncogene-driven mammary cancer. 

To identify ubiquitylation substrates, we performed in vitro ubiqui- 
tylation reactions on 9,000 human proteins. The transmembrane receptor 
tyrosine kinase TYRO3 was identified as the top candidate for CBL-B- 
mediated ubiquitylation (Extended Data Fig. 8a). Tyro3, together with 
Axl and Mer, constitute the TAM family of cell-surface tyrosine kinase 
receptors’. In additional in vitro assays using recombinant proteins, 
we confirmed that CBL-B ubiquitylates all three TAM family proteins 
(Fig. 3a and Extended Data Fig. 8b). Activation of HeLa cells with GAS6, 
a known ligand for TAM receptors”, resulted in recruitment of CBL-B 
to TAM receptors and AXL ubiquitylation; CBL-B knock-down using 
short interfering RNA (siRNA) reduced GAS6-induced AXL ubiqui- 
tylation (Extended Data Fig. 8c, d). Thus, TAM receptors are molecular 
substrates for CBL-B ubiquitylation in vitro and in vivo. 

In contrast to TAM triple-mutant mice”’, loss of Cbl-b had no overt 
effect on NK cell development (Extended Data Fig. 1c, d). The absence 
of Cbl-b, or its enzymatic E3 ligase activity, also had no apparent effect 
on basal Tyro3, Axl or Mer surface expression in splenic NK cells 


> 


(Extended Data Fig. 8e). Because TAM receptors can inhibit macro- 
phage and dendritic cell activation’, we proposed that TAM receptors 
might also negatively regulate NK cells. Gas6 indeed suppressed pro- 
liferation and IFN-y production of NKG2D-activated NK cells; Cbl- 
b-’~ NK cells were resistant to this inhibition (Fig. 3b and Extended 
Data Fig. 8f). Mechanistically, whereas Gas6 stimulation resulted in 
downregulation of all three TAM receptors from the cell surface, this 
downregulation was impaired in NK cells lacking Cbl-b (Fig. 3c). Impaired 
receptor downregulation was confirmed in HeLa cells using siRNA- 
mediated CBL-B knockdown (Extended Data Fig. 8g, h). Thus, Cbl-b/ 
TAM receptors constitute an inhibitory pathway for NK cell activation. 

To assess whether modulation of this pathway could be used for 
anti-tumour immunotherapy, we developed a highly selective TAM 
kinase inhibitor, termed LDC1267. LDC1267 preferentially inhibits 
Tyro3, Axland Mer at low nanomolarity, as determined by tracer-based 
binding assays (Fig. 3d, e). Selectivity of LDC1267 was further assessed 
using a cell-free KINOMEscan assay covering 456 kinases (Fig. 3f and 
Supplementary Table 1). Cellular selectivity was confirmed using a quant- 
itative chemical proteomics approach, cell-based phosphorylation and 
proliferation assays (Extended Data Fig. 9a-d). Treatment of NKG2D- 
activated NK cells with LDC1267 indeed abolished the inhibitory 
effects of Gas6 stimulation; LDC1267 had no apparent additional effect 
in Cbl-b-deficient NK cells (Fig. 3g). In vivo, wild-type mice treated 
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Figure 3 | TAM receptors are Cbl-b targets and characterization of a TAM 
receptor blocker. a, In vitro Cbl-b-dependent ubiquitylation of Tyro3, Axl and 
Mer (anti-Ub). Control, no TAM receptors. Loading controls are shown. 

b, IFN-y* Cbl-b*/~ and CbI-b-’~ NK cells (%) stimulated with anti-NKG2D 
antibodies in the presence of soluble Gas6. *P < 0.05 (one-way ANOVA, 
Dunnett’s post-hoc test, n = 6/5). c, TAM receptor surface expression in NK 
cells after stimulation with Gas6. *P < 0.05 (two-way ANOVA, Bonferroni’s 
post-hoc test, n = 10-13). d, Chemical structure of the TAM receptor kinase 
inhibitor LDC1267. e, Half maximal inhibitory concentrations (ICso) for the 
indicated protein kinases as determined by tracer assays. f, Remaining activity 
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(compared to dimethylsulphoxide (DMSO) control) in 456 kinases treated with 
1 uM LDC1267. g, IFN-y* Cbl-b*’~ and Cbl-b-/~ NK cells (%) pre-treated 
with vehicle or LDC1267 and then stimulated with anti- NKG2D antibodies 
and Gas6. *P < 0.05; NS, not significant (Student’s t-test, n = 11/8). h, In vivo 
NK cytotoxicity towards RMA-Rael cells in mice treated with vehicle or 
LDC1267. *P < 0.05, **P <0.01 (one-way ANOVA, Tukey’s post hoc test, 

n = 16/14/9/10). i, Tumour-to-lung areas in B16F10 tumour-bearing B6 mice 
untreated or adoptively transplanted with NK cells ex vivo-treated with vehicle 
or LDC1267. *P < 0.05 and **P < 0.01 (one-way ANOVA, Tukey’s post-hoc 
test, n = 16/16/16/10/10). Data in b, c, g-i are mean + s.e.m. 
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with LDC1267 showed enhanced cytotoxicity towards RMA cells over- 
expressing the NKG2D ligand Rae-1 (RMA-Rae1) to the same extent 
as C373A*"*' mice, but had no effect on the already enhanced NK 
cytotoxicity in Cbl-b-mutant mice (Fig. 3h and Extended Data 9e). To 
provide definitive proof that LDC1267 treatment can license NK cells 
to kill, we treated B16F10 melanoma-bearing mice with NK cells that 
were either untreated or treated ex vivo with our TAM inhibitor (Extended 
Data Fig. 9f). Adoptive transfer of LDC1267-treated wild-type NK cells 
significantly increased the anti-tumour response to levels observed in 
mice transplanted with CbI-b~‘~ NK cells, but did not increase the 
anti-metastatic efficacy of Cbl-b-knockout NK cells (Fig. 3i), reinfor- 
cing the notion that Cbl-b acts downstream of TAM receptors for NK 
cell inhibition. These results indicate that TAM receptor inhibition 
using LDC1267 unleashes NK cells to kill tumour cells in vivo. 

To test whether our novel TAM inhibitor can be used in an in vivo 
therapeutic setting, we challenged mice with B16F10 melanoma fol- 
lowed by intraperitoneal LDC1267 treatment. LDC1267 markedly 
reduced metastatic spreading of melanomas; NK1.1 depletion abolished 
the therapeutic benefits of LDC1267 (Fig. 4a—c). We next set-up an 
orthotopic metastatic breast cancer model, thatis, injection of 4T1 breast 
cancer cells into the mammary fat pad of BALB/c mice’*. LDC1267 
treatment significantly reduced the numbers and sizes of 4T1 micro- 
metastases in the liver, without any apparent effect on the primary 
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Figure 4 | Control of metastases by TAM receptor inhibition. a-c, Lung 
melanoma metastases (a), tumour-to-lung ratios (b, mean + s.e.m.) and 
extrapulmonary metastases (c, lines are median) in vehicle- and LDC1267- 
treated wild-type control or NK1.1-depleted mice. n = 8 each. *P < 0.05, 
*** P< 0.001; NS, not significant (Student’s t-test in b and Mann-Whitney 
U-test in c). d-f, Representative images (d, arrows; scale bars, 50 um), and 
numbers (e) and relative sizes (f) of 4T1 liver micro-metastasis in control or 
asialo-GM1 immunodepleted mice treated with vehicle or LDC1267 

(20 mgkg ', daily intraperitoneal injection). Mean + s.e.m., n = 7/9/8/8. 
*P < 0.05; NS, not significant (Mann-Whitney U-test). g, h, Representative 
lung B16F10 melanoma metastases (g) and tumour-to-lung ratios (h) in 
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mammary tumour (Fig. 4d-f and Extended Data Fig. 10a—-c). NK cell 
depletion using anti-asialo GM1 antibodies resulted in markedly enhanced 
4T1 liver metastases and abolished the therapeutic benefits of LDC1267 
(Fig. 4d-f). We finally tested whether LDC1267 could be also effective 
in the 4T1 liver metastasis model, when administrated orally (Extended 
Data Fig. 10d). Oral LDC1267 significantly reduced 4T1 liver micro- 
metastases (Extended Data Fig. 10e-g). Thus, using different model 
systems and different routes of administration, in vivo therapy with the 
TAM inhibitor LDC1267 consistently reduced metastases dependent 
on NK cells. 

Warfarin is the most widely used oral anticoagulant worldwide’® 
and has been known to reduce tumour metastases in mouse and rat 
models*’”. The underlying mechanisms were largely speculative’*”’. 
Besides its effects on coagulation, warfarin prevents y-carboxylation of 
TAM ligands, rendering Gas6 unable to activate TAM receptors”. We 
therefore proposed that warfarin could exhibit anticancer effects via 
modulation of the Gas6/TAM/Cbl-b pathway in NK cells. Administration 
of low-dose warfarin, which inhibits TAM receptor activity in vivo”! 
without affecting coagulation, markedly reduced melanoma metastases 
to the lungs and distal organs of Cbl-b*’* and CbI-b*’~ mice but had 
no apparent effect on CbI-b-defective mice; the absence of NK1.1* cells 
abolished the effects of warfarin on metastatic melanomas (Fig. 4g-j 
and Extended Data Fig. 10h-j). Moreover, warfarin treatment resulted 
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vehicle- and warfarin-treated (0.5mgl_') Cbl-b*/* (n = 12/12), Cbl-b*/— 

(n = 6/7), Cbl-b-’~ (n = 4/7) and C373Ax™™! (n = 8/8) mice. Mean + s.e.m. 
**P < 0.01, *P < 0.05; NS, not significant (Student’s t-test). i, j, Relative lung 
tumour area (i, mean + s.e.m.) and extrapulmonary melanoma metastases 
(j, lines are median) in Cbi-b*/* mice treated with vehicle (n = 15), warfarin 
(n = 14), vehicle and anti-NK1.1 antibodies (m = 5) or warfarin and anti- 
NK1.1 antibodies (n = 7). *P < 0.05, ***P < 0.001; NS, not significant 
(one-way ANOVA, Tukey’s post-hoc test for i; Mann-Whitney U-test for j). 
k, Schematic illustrating the TAM receptor/Cbl-b inhibitory pathway 
controlling NK cell anti-metastatic activities. 
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in enhanced in vivo NK cell cytotoxicity towards their prototypic tar- 
gets RMA-S and RMA-Rael (Extended Data Fig. 10k). Our data sup- 
port a coagulation-independent role of warfarin in tumour metastasis 
that relies on functional TAM receptors/Cbl-b in NK cells. 

Our experiments uncover a novel inhibitory role of the E3 ligase 
Cbl-b in the regulation of NK cell functions via TAM receptors (Fig. 4k). 
Releasing the inhibition imposed by this TAM/Cbl-b pathway renders 
NK cells capable of rejecting tumour metastases. There is high redund- 
ancy for TAM receptor functions in immune cells”. Because all three 
TAM receptors are expressed on NK cells’, they most likely, as they do 
in NK cell development’’, also exert redundant functions in NK cell 
activation. Indeed, genetic deletion of Tyro3 alone affected neither NK 
cell activation nor in vivo metastases control by NK cells (not shown). 
Mechanistically, upon ligand stimulation, Cbl-b is recruited to and 
ubiquitylates TAM receptors, which in turn appears to control TAM 
receptor internalization, corroborating the role of Cbl-b and its homo- 
logue c-Cbl in receptor tyrosine kinase endocytosis”. Of note, c-CBL 
has been implicated in the ubiquitylation of AXL in human epithelial 
HLEC cells**. Whether Cbl-b affects NK cell activation solely via TAM 
receptors and/or also directly impinges on other receptor systems needs 
to be determined. 

Current tumour immunotherapies face the challenge of severe side 
effects’®. As Cbi-b-knockout mice and mice treated with our TAM blocker 
do not exert any severe signs of inflammation or autoimmunity, there 
might be a therapeutic window for Cbl-b/TAM receptor inhibition to 
mediate tumour rejection without serious cytotoxicity. We also pro- 
vide a molecular explanation for the longstanding conundrum on how 
warfarin confers anti-metastatic activity, offering the possibility to re- 
assess the use of vitamin K antagonists such as warfarin in cancer 
therapy. Importantly, the anti-tumour effects of our TAM inhibitor 
provide a proof of principle that it might be possible to develop a ‘pill’ 
that awakens the innate immune system to kill cancer metastases. 


METHODS SUMMARY 


Cbl-b and C373A*”™ mice have been reported previously”””. Subcutaneous TC-1 
and B16F10 tumours were induced by injection of 2.5 X 10° tumour cells in the left 
flank of mice. For 4T1 tumours, 5 X 10° 4T1 cells were inoculated directly into the 
third mammary fat pad. Mammary tumours and lung metastases in NeuT* mice 
arise spontaneously’®. For melanoma metastases, 2.5 X 10° B16F10 cells were 
injected intravenously into the tail veins. Lungs were dissected for tumour quan- 
tification and the presence of metastases in any other organ was recorded. For 
adoptive transfer of NK cells, wild-type C57BL/6J recipient mice were inoculated 
intravenously with 2.5 X 10° B16F10 at day 0, and one day later were either left 
untreated or inoculated intravenously with 1 X 10° purified Cbl-b'’*, Cbl-b-/~ or 
C373A‘""! ex vivo-expanded NK cells. For blocking TAM receptors, mice were 
orally treated in their drinking water with low doses of warfarin (0.5 mg ml” '). The 
TAM small molecule kinase inhibitor LDC1267 was developed applying rationale 
design based on the homology modelling and known non-specific TAM receptor 
kinase inhibitors. To study the therapeutic potential of TAM inhibition, ex vivo- 
expanded and purified NK cells were treated for 2.5 h with LDC1267 (2.5 1M) or 
vehicle and intravenously injected into wild-type C57BL/6] recipient mice bearing 
B16F10 tumours. For 4T1 and melanoma metastases experiments, mice were treated 
with LDC1267 using daily oral gavage (100 mg kg‘) or intraperitoneal injections 
(20 mgkg~'). All tumours were quantified blindly on the basis of a histomorpho- 
metric analysis using the Definiens Developer XD image analyser software. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Transcription factor achaete-scute homologue 2 
initiates follicular T-helper-cell development 
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In immune responses, activated T cells migrate to B-cell follicles 
and develop into follicular T-helper (Ty) cells, a recently identified 
subset of CD4* T cells specialized in providing help to B lympho- 
cytes in the induction of germinal centres’. Although Bcl6 has been 
shown to be essential in T};-cell function, it may not regulate the 
initial migration of T cells’ or the induction of the Tpy program, as 
exemplified by C-X-C chemokine receptor type 5 (CXCR5) upregu- 
lation*. Here we show that expression of achaete-scute homologue 2 
(Ascl2)—a basic helix-loop-helix (bDHLH) transcription factor’—is 
selectively upregulated in T;,; cells. Ectopic expression of Ascl2 upre- 
gulates CXCR5 but not Bcl6, and downregulates C-C chemokine 
receptor 7 (CCR7) expression in T cells in vitro, as well as accelerating 
T-cell migration to the follicles and T;,;-cell development in vivo in 
mice. Genome-wide analysis indicates that Ascl2 directly regulates 
Tyy-related genes whereas it inhibits expression of T-helper cell 1 
(Ty1) and Ty17 signature genes. Acute deletion of Ascl2, as well as 
blockade of its function with the Id3 protein in CD4* T cells, results 
in impaired T};-cell development and germinal centre response. 
Conversely, mutation of Id3, known to cause antibody-mediated 
autoimmunity, greatly enhances T},-cell generation. Thus, Ascl2 
directly initiates T-cell development. 

The development of Ty; cells is initiated by and dependent on their 
movement out of the T-cell zone and into the B-cell follicle. This migra- 
tion process is regulated by upregulation of CXCR5 as well as down- 
regulation of both CCR7 and P-selectin glycoprotein ligand 1 (PSGL1)”*. 
Trx Cells have unique developmental regulation and Bcl6 was reported 
to be selectively expressed in Ty cells’. However, although Bcl6 poten- 
tiates Tpy-cell generation in vivo, recent data suggest that it may not 
regulate CXCR5 upregulation by activated T cells or their migration to 
B-cell follicles in vivo**. Hence, the transcriptional mechanisms under- 
lying initial Ty4-cell commitment remain unclear’. 

Recently, we observed that a bHLH-domain-containing transcrip- 
tion factor, Ascl2, was highly expressed in CXCR5"Bcl6" cells in com- 
parison with CXCR5 Bcl6— T cells‘. Interestingly, CKCR5‘ Bclo’° T 
cells also exhibited upregulation of Ascl2 messenger RNA expression“, 
suggesting that its upregulation may precede that of Bcl6. Moreover, 
the Ascl2 gene locus was marked with active chromatin marker tri- 
methylated histone H3 lysine 4 (H3K4me3) in Tpy and, to a much less 
extent, T};2 cells, but not in other T-cell subsets, whereas the other Tpy- 
regulating genes Bcl6 (refs 7-9), Maf '°, Batf ‘*’* and Irf4 (ref. 13) were 
uniformly associated with H3K4me3 in all T-cell subsets (Extended 
Data Fig. 1a). To validate these results, we sorted three subpopulations 
of cells (CXCR5- REP", CXCR5* REP" and CXCRS"REP™; REP, red 
fluorescent protein) from Bcl6-RFP reporter mice immunized with key- 
hole limpet haemocyanin (KLH) emulsified with complete Freund’s 
adjuvant (CFA) (Fig. la), and found that Ascl2 was highly expressed 


in Ty cells at both mRNA and protein levels (Fig. 1b and Extended 
Data Fig. 1b). Also, Ascl2 expression was closely correlated with that of 
CXCRS (Fig. 1b) and was higher in Tyy cells than in other T-cell sub- 
sets (Fig. 1c). In human T cells, expression of Ascl2 as well as CKCR5 
and Bclé was found in human tonsil CXKCR5™PD1" T,,, cells (Fig. 1d, e). 
Collectively, these results suggest that Ascl2 is highly expressed in Try 
cells and that its expression may precede that of Bcl6. 

Bcl6 and Batf are necessary in Tyy;-cell development*””, whereas Stat5 
inhibits T},-cell development'*’’. Overexpression of Bcl6 or Batf, or 
Stat5 deficiency, failed to increase Ascl2 expression (Extended Data 
Fig. 1c). None of the known stimuli, including anti-CD3, anti-CD28, 
anti-ICOS, interleukin (IL)-6 and IL-21, nor their combination, upre- 
gulated Ascl2 expression in T cells (Extended Data Fig. 1d). Ascl2 was 
previously shown to bea target of canonical Wnt signalling in intestinal 
stem cells’, and we also found that Ascl2 and CXCR35 but not Bcl6 expres- 
sion in CD4* T cells could be upregulated by TWS119 (ref. 16; Fig. 1f 
and Extended Data Fig. 1d, e) or other Wnt agonists (data not shown). 

As a first step to examine the function of Ascl2 in Tpy cells, retro- 
viral overexpression of Ascl2 was conducted in CD4* T cells, leading 
to substantial induction of CXCR5 expression in over 30% of trans- 
duced cells, whereas overexpression of Bcl6, Batf or Maf in purified 
T cells did not (Fig. 2a and Extended Data Fig. 2a). Ascl2 overexpres- 
sion increased Cxcr5 mRNA expression by ~60 fold (Fig. 2b), without 
affecting Bcl6, Prdm1, Batf, Sh2dla, Cd40lg, Icos, Pdcd1, Btla and 
1121 expression (Fig. 2c). CXCR5 expression was equally induced by 
Ascl2 in wild-type, Bclo ‘~ and Batf ‘~ CD4* T cells in vitro (Fig. 2d). 
Thus, our findings suggest that Ascl2 is unique in its ability to induce 
CXCRS protein expression in CD4* T cells in vitro. CCR7 and PSGL1 
(Selplg) as well as CD25 (I/2ra) and CD122 (I/2rb) expression were down- 
regulated in Ascl2-overexpressing T cells (Fig. 2e and Extended Data 
Fig. 2b), probably accounting for increased follicular homing ability 
and decreased IL-2 signalling in Tpy; cells'*"*'”. In addition, T};1- and 
Ty17-related signature genes were strongly suppressed by Ascl2 (Extended 
Data Fig. 2c). 

We then assayed the role of Ascl2 in vivo by transferring Ascl2- 
transduced OT-II cells. In Terb~/~ recipient mice, at day 2 after immuni- 
zation with 4-hydroxy-3-nitrophenyl (NP)- ovalbumin (OVA) in CFA, 
neither CXCR5 nor Bcl6 expression were detectable in the vector- 
transduced control group, whereas Ascl2 overexpression strongly increased 
in CXCR5‘ Bcl6" cells (Fig. 2f, g). By contrast, ectopic expression of 
Bcl6 did not promote T-cell generation at this time point (Extended 
Data Fig. 2d, e). At day 6 after immunization, Ascl2 overexpression 
induced a higher percentage of CXKCR5"Bcl6™ Tpy cells (Fig. 2f, g). 
Accordingly, germinal centre B cells and the total area of the germinal 
centre at day 8 in C57BL/6 mice receiving Ascl2-transdued T cells were 
significantly increased (Fig. 2h-j). Anti- NP immunoglobulin (Ig)M, 
IgA, IgG1 as well as IgG3 titres were increased in Tcrb ‘~ mice, whereas 


1Tsinghua University School of Medicine, Beijing 100084, China. Department of Immunology, MD Anderson Cancer Center, Houston, Texas 77054, USA. °Department of Lymphoma and Myeloma, MD 
Anderson Cancer Center, Houston, Texas 77054, USA. “Institute for Systems Biology, Seattle, Washington 98103, USA. °Laboratory of Immunology, National Eye Institute, NIH, Bethesda, Maryland 20892- 
1858, USA. °Hubrecht Institute-KNAW and University Medical Center Utrecht, Uppsalalaan 8, 3584 CT Utrecht, the Netherlands. 7National Institute of Dental and Craniofacial Research, NIH, Bethesda, 
Maryland 20892-2190, USA. ®State Key Laboratory of Ophthalmology, Sun Yat-sen University, Guangzhou 510275, China. +Present addresses: College of Life Sciences, Wuhan University, Wuhan 430072, 


China (B.Z.); SomantiX B.V., Padualaan 8, 3584 CH Utrecht, the Netherlands (L.G.v.d.F.). 


27 MARCH 2014 | VOL 507 | NATURE | 513 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a = CD4*CD44*Tcell Bb _ c 
ue} = 
108 i} 3 
137 15.3 =e te & 207,, 
g 15 : S10 
i 10 z 
f ae — 5 
2 x 
ad 2s 6.O7 660 © OY oe ee ee 
Bcl6-RFP 2 OMY OOM OO 0.0 6 & cae eee 
cc See EK oO co she Ons Ce FE 2 PRET 
Naive CXCR5- CXCR5* CXCRS" a 
é Guiman tonsil as Bclé- Bcl6® ~—-Bcl6" 
‘ cS) 
Cp4" T cell 2 40 Control | TWS119 (1 uM) 
10° 3 oe 104 0 
4 2 3 31 
210 a =~ 20 ee 9 10 o 
Qigs lisa! ra : © 102 2.08 102 43.4 
6) = 10 x 
107 1° FN Q 101 ot 
O41 ti ic 0 Be 0°. ae 
pre oa © 
010? 10° 10% 10° rc oS LSP Ss Shp PSs ep 1009 10! 102 10° 104 10° 10° 102 10° 108 
PDI CnC Ols Bol ———> 
CXCR5- CXCR5* CXCRS" 
PD1- PD1® = PDA 


Figure 1 | Ascl2 is selectively expressed i in both mouse and human Ty, cells. 
a, Three populations of CKCR5 1B c16-REP* (red), CKCR5* Bcl6-REP"® (blue) 
and CXCR5 Bcl6-RFP (black) cells were sorted from dLNs in Bcl6-RFP mice 
immunized with KLH emulsified in CFA subcutaneously. b, Ascl2, CKCR5 
and Bcl6 transcriptional expression in sorted cells. c, Ascl2 mRNA expression 
among in-vivo-generated Ty, naive, T}0, Ty1, Ty2 and Ty17 cells by 
quantitative polymerase chain reaction with reverse transcription (RT-PCR). 
d, Flow cytometric analysis of human tonsil CD4* T cells by CXCR5 and PD1 


IgG2a and IgG2b were not affected by Ascl2-overexpressing T cells 
(Fig. 2k), consistent with IgG2a switching being primarily mediated by 
extrafollicular T cells’*. 

We next assessed whether Ascl2 could promote T-cell follicular hom- 
ing in vivo. Ascl2-overexpressing OT-II cells preferentially accumulated 
in follicles (Fig. 21), and even in the germinal centre (Fig. 2)), in com- 
parison with control vector-infected T cells. Given that Bcl6 overex- 
pression does not affect early Tyy4-cell generation (Extended Data Fig. 2d, e) 
and T-cell migration’, these observations collectively demonstrate 
that, by contrast with Bcl6, Ascl2 promotes T-cell migration to the 
follicles and the initiation of Tpy-cell development. 

To investigate the mechanism of Ascl2-controlled Ty;;-cell genera- 
tion, we performed microarray analysis, and found that the expression 
of 293 genes was changed by more than twofold by Ascl2 overexpres- 
sion. Cross-referencing the current data set of Ascl2 versus vector with 
our previous Tpy;-cell versus non-T p,-cell data set* revealed that 85 of 
the 293 genes affected by Ascl2 were directly associated with Tyy-cell 
differentiation: 22 genes were upregulated and 63 genes were down- 
regulated* (Fig. 3a, b and Supplementary Table 1). The chemokine 
receptors CXKCR5 and CXCR4, which are germinal centre Tpy-related 
receptors’”, were at the top of the upregulated gene list, whereas T};1- 
related genes (I/12rb1, Tbx21, Ifng and Gzmb) and the T);17-related 
aryl hydrocarbon receptor (Ahr) gene were greatly suppressed by Ascl2 
(Fig. 3a). When comparing Ascl2-RV-GFP-infected (GFP, green fluor- 
escent protein; RV, retroviral vector) CKCR5~ T cells and CXCR5_ T 
cells with in-vivo-generated Tz and non-Tyy cells*, we found that 
Ascl2-induced CXCR5™ T cells were more similar in gene expression 
to Try cells (Extended Data Fig. 2f, g), with ~350 genes commonly 
expressed in these cells (Extended Data Fig. 2h). 

We further examined the effect of Ascl2 on Ty1-, Ty2- and Ty17-cell 
differentiation. As shown in Extended Data Fig. 3a, overexpression of 
Ascl2 suppressed both T}y1 and Ty17 differentiation and induced 
CXCRS expression. Ascl2 had no effect on TGF-f-induced Foxp3 expres- 
sion but induced CXCR5* regulatory T (Treg) cells, suggesting that it 
may also be related to follicular T regulatory (Tp)-cell generation”. 
Under T}42-polarized conditions, Ascl2 enhanced IL-4 expression, whereas 
it inhibited the expression of Gata3, IL-5 and IL-13 (Extended Data 
Fig. 3a—c), in agreement with recent studies that showed IL-4 but not 
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staining. e, The expression of Ascl2, Cxcr5, Bcl6 and Gata3 mRNA in sorted 
cells. f, CKCR5 expression in CD4° T cells activated by anti-CD3/anti-CD28 in 
the presence of TWS119 (1 1M) (an inhibitor of glycogen synthase kinase 

3B (GSK-3B)) for 3 days. All experiments were repeated at least three times 
with similar results. b, c, e, Bar graphs show the relative level of mRNA as 
mean + standard deviation (s.d.), n = 3 per group. *P < 0.05, **P< 0.01, 
two-tailed t-test. 


Gata3, IL-5 or IL-13 expression in Tpy cells*’. Also, we observed that 
Ascl2 increased IL-4 but not IL-21 production in vivo (Extended Data 
Fig. 3d, e). Therefore, Ascl2 promotes Tpy gene expression and inhibits 
Tyl-, Ty2- and Ty17-related gene expression. 

We next assessed Ascl2 target genes by chromatin immunoprecipi- 
tation coupled with high-throughput sequencing (ChIP-seq). The ana- 
lysis revealed a total of 10,028 Ascl2-binding peaks, among which 41% 
and 36% were enriched in intronic and intergenic regions, respectively 
(Fig. 3c). Only 20% of Ascl2-binding sites were located at promoter 
regions (Fig. 3c). Further comparison of global Ascl2-binding sites with 
the Ascl2-regulated gene list showed that 145 among 4,374 Ascl2- 
bound genes were transcriptionally regulated by Ascl2 (Fig. 3d). 

As anticipated, analysis of the Ascl2-binding peaks identified an 
E-box protein-binding site (5'-CANNTG-3’) as the consensus motif? 
(Fig. 3e). Ascl2-binding sites were identified in groups of gene loci 
including receptor genes (Cxcr5, Cxcr4, Ccr7, Selplg, [l2ra and Il2rb) 
and inflammatory signature genes (Ifng, Tbx21, Il2 and Rorc) (Fig. 3f), 
but not in some Ty;,-related genes (Bcl6, Prdm1, Pdcd1, Sh2d1a, Icos, 
1121 and Cd40lg) (data not shown). 

In particular, the Cxcr5 locus was found to have multiple Ascl2-binding 
sites in the conserved non-coding sequence (CNS) regions (Fig. 3f, g). 
Moreover, these Ascl2-binding sites at the Cxcr5 locus were confirmed 
in in-vivo-generated Ty cells: two strong binding sites at intronic regions 
(CNSS5 and CNS4-3), and three at distal promoter region (CNS1-1, CNS1 
and CNS2) (Fig. 3f-h). Of note, the strongest Ascl2-binding peak was in 
the CNS5 region, which is consistent with the E47- (bHLH family mem- 
ber) binding site at the Cxcr5 locus” (Fig. 3f-h), implying a potentially 
redundant role of E2A in transcriptional regulation of the Cxcr5 gene”. 

To examine the functional significance of Ascl2 binding in regulat- 
ing CXCRS expression, we introduced 1d3, an inhibitor of the E-box 
protein”, into Ascl2-overexpressing OT-II cells by retroviral infection, 
and observed a substantial reduction in Ascl2-regulated CXCR5 expres- 
sion (Extended Data Fig. 4a); this reduction was due to the inhibition of 
Ascl2 binding at the Cxcr5 locus, as revealed by ChIP assay (Extended 
Data Fig. 4b). Additionally, a luciferase reporter assay showed that CNS5 
and CNS] were responsive to Ascl2 binding (Extended Data Fig. 4c, d). 
Together, these results provide evidence that Ascl2 is involved in the 
direct control of Tyy-cell programming. 
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Figure 2 | Ascl2 expression induces the Try 
program. a, Flow cytometry analysis of surface 
CXCRS expression from empty-RV-GFP, 
Ascl2-RV-GFP and Bcl6-RV-GFP retrovirus- 
infected T cells, respectively. b, Sorted GFP™ T cells 
were subjected to the measurement of Cxcr5 
mRNA by quantitative RT-PCR. c, Measurement 
of gene expression, including Cxcr5, Bcl6, Prdm1, 
Batf, Sh2d1a, Cd40lg, Icos, Pdcd1, Btla and 1121. 

d, Surface CXCRS5 expression in Ascl2-RV-GFP- 
and vector-virus-infected wild-type (WT), Belo /— 
and Batf ‘~ T cells. e, CCR7, PSGL1, CD25 and 
CD122 expression by flow cytometry analysis. 
Dashed line, isotype; solid line, Ascl2-RV-GEFP; 
shaded area, empty-RV-GFP. f-i, k, Ascl2-RV- 
GFP- or empty-RV-GFP-transduced GFP OT-II 
cells were transferred into naive Tcrb ’~ mice 
subsequently immunized with NP-OVA/CFA. 

f, At day 2 and day 6, flow cytometry analysis of 
donor cells was undertaken with staining for 
CXCRS and Bcl6 (n = 4). g, Quantification of 
CXCR5* and CXCR5"Bcl6* donor-derived T 
cells. h, Germinal centre B cells (GL7™ BAS") in 
recipient mice (n = 4). i, Quantification of 
germinal centre (GC) B cells. j, Ascl2-RV-GFP- or 
empty-RV-GFP-transduced GFP* OT-II were 
transferred into congenic mice. Four days later, 
each mouse was immunized subcutaneously with 
OVA (30 11g)/Alum/LPS. At day 8, dLNs were 
collected and subjected to histochemistry analysis. 
Green, GFP; red, Bcl6; blue, anti-IgD; scale bar, 
100 ppm, dot graph represents GC areas per slide 
view (some slides in Ascl2 group contain > 1 GC), 
displayed as mean + s.d., control group (n = 5), 
Ascl2-RV-GFP group (n = 12). k, Titres of 
NP-specific antibodies in serum from mice on day 
8 after immunization (n = 4). 1, Distribution of 
Ascl2-RV-GEP- and vector-infected GFP* OT-II 
donor cells in IgD* B-cell follicles from dLNs in 
mice immunized with OVA/Alum/LPS for 4 days. 
Scale bar, 100 j1m. Dot graph represents 
distribution with the ratio of donor cells in the 
B-cell follicle (fol.) versus the T zone, displayed as 
mean + s.d. Empty-RV-GFP, n = 21; Ascl2-RV- 
GFP, n = 15. All experiments were repeated at least 
two times with similar results. b, c, g, i, j-1, Graphs 
show data as mean = s.d., two-tailed t-test. 

NS, not significant. 


| 515 


LETTER 


a b Tey VS Ascl2 vs 
a non-T, vector 
= a 
[o} 
me 
= g 
& z 
Q a 
Q a 
z= <x 
i 
Non-T,,, (CXCRS~ Bcl6é-) Empty-RV-GFP 
Gene Tey VS Ascl2 vs Gene ¢ 
symbol non-T, vector description 
Cxerd 11 6 Chemochine (C-X-C motif) receptor 5 ¥ é 
Cxcr4 5 S Chemokine (C-X-C motif) receptor 4 =H thal (20%) 
Ahr -2 -3 Aryl-hydrocarbon receptor Giron as 
+ 0. 
I12rb1 -4 -3 Interleukin 12 receptor, B1 alntergenic (36%) 
M2 =5 -2 Interleukin 2 
Tbx21 -14 -4 T-box 21 
lfng -18 3 Interferon-y 
Gzmb -34 -5 Granzyme B 
d Expression Binding e 
E-box: 
5’-CANNTG-3’ 
Cxer5 < Cxcr4 <— 
f 60a i C= 
. Vt Upregulated 
0 0 genes 
Conservation SS PP TYPE aT TTT AVERT eT (0 eT TET 
Ccr7_ < Selplg < l2ra_ —> l2rb << 
Cr == 
2.5 | 44 a 0.6 3.2 | 
ol ile wu | | Lame shu 0 I 0 cb 
mente tb hme ahipurntihradly « a tipo 1+: Mipshtbeiin ht ea anaola] Aantal sidan + madam wala rnin ee ee ee Downregulated 
ling —> Tbx21 < I2 + Rorc —> genes 
He i] 
2.3 | 0.7 | 3.2 | 3.9 | 
0 0 | 0 | 0 
SS PS UP SVN T PNY a MT Do a Al] ae ip as fe ae IA I Tl] 
h 
= 25, CNSS & Anti-Ascl2 
2 -® IgG control 
£ 20 
iS] 
5 iS CNS2 
oO it 
g 10 ee CNS1 CNS1-1 
all | 
@ oli 0 pe-Ca-e-oes f wie 0-9 


Figure 3 | Ascl2-dependent transcriptional regulation of Tpy-related genes. 
a, Scatterplot of the average signal of Tpy versus non-Tpy cells, and Ascl2-RV- 
GFP* versus empty-RV-GFP~ T-cell gene expression microarray data. The 
green line indicates gene expression change by a factor of 2; data normalized 
from two replicates are shown; n = 2. Genes with the most transcriptional 
changes are listed. b, Venn diagram of genes regulated by Ascl2 and Ty;-related 
genes. c, Distribution of Ascl2 ChIP-seq peaks in Ascl2-overexpressed CD4* T 


As Maf"®, Batf"!”? and IRF4 (ref. 13) are required in Tpy-cell differ- 
entiation, we compared genome-wide occupancy of these transcrip- 
tional factors, and found that Ascl2-bound genes hardly correlated 
with those bound by Maf (Extended Data Fig. 5a). For instance, IL-21 
is directly regulated by Maf™, but not by Ascl2 (Extended Data Fig. 5b). 
Additionally, there was no binding site for Maf at the Ascl2 locus (data 
not shown), or vice versa (Extended Data Fig. 5b), suggesting that 
Ascl2 and Maf are functionally independent in Tpy cells. A large pro- 
portion of Ascl2 occupancy colocalized with Batf/IRF4-binding sites 
(Extended Data Fig. 5c), including at the CNS of gene loci including 
Cxcr5, Cxcr4, Cer7, Selplg, 112, [l2ra and [l2rb (Extended Data Fig. 5d), 
but not at Bcl6, Prdm1 and Maf (Extended Data Fig. 5b). Interestingly, 
Batf/IRF4 and Ascl2 peaks were found to be independently localized in 
Tyl, Ty2 and Ty17 signature genes (Extended Data Fig. 5e-g), indi- 
cating that Batf/IRF4 acts downstream of the T-cell antigen receptor 
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cells. d, Venn diagram of genes regulated by Ascl2 and Ascl2-bound genes. 

e, The Ascl2-binding site is identical to the E-box-binding motif. f, Ascl2- 
binding peaks located at gene loci including Cxcr5, Cxcr4, Ccr7, Selplg1, Il2ra, 
Il2rb, Ifng, Tbx21, 112 and Rorc. g, Comparison of the mouse Cxcr5 genomic 
sequence with that of dog, human and rat. Red regions denote CNS region. 
h, Ascl2-binding sites at the Cxcr5 locus in in-vivo-generated Ty; cells as 
determined by ChIP-qPCR (primers are listed in Supplementary Table 2). 


(TCR) in activation of effector programs”, whereas Ascl2 mediates 
suppression. These data strongly support Ascl2 acting as a specific 
regulator in Tpy cells. 

Try cells provide important help to B cells in the induction of 
efficient anti-virus antibodies during viral infection®”*. To address 
the functional roles of Ascl2, we generated Ascl2“"/CD4-Cre mice, 
in which T cells were developmentally intact (data not shown), and 
assessed the requirement of Ascl2 in Tpy-cell development in vivo with 
influenza virus infection. After intranasal infection, Ascl2“"/CD4-Cre 
mice developed gradual body weight loss from day 3 to day 9, whereas 
control littermate mice recovered after day 8 (Extended Data Fig. 6a). 
At day 9 post-infection (d.p.i.), viral haemagglutinin (HA) mRNA 
expression in the lungs of Ascl2“"/CD4-Cre mice was over fivefold 
higher than that in control mice (Extended Data Fig. 6b), whereas 
CXCRS5*Bcl6* Ty cells in lung draining lymph nodes (dLNs) had 
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decreased by over 50%, there was a ~3-4-fold decrease in total cell 
numbers in the absence of Ascl2 (Fig. 4a), probably accounting for a 
severe reduction in germinal centre B-cell formation and anti-viral IgG 
production (Extended Data Fig. 6c, d). Also, Tpy-cell and germinal 
centre B-cell development in the spleen was reduced in Ascl2“"/CD4- 
Cre mice (Extended Data Fig. 6e, f). No difference was found between 
CD8* T-cell populations in the lung, bronchoalveolar lavage fluid 
(BALF), spleen and dLNs in Ascl2”"/CD4-Cre mice and control mice 
(Extended Data Fig. 6g), although CD8* T-cell activation in the dLNs 
of Ascl2“"/CD4-Cre mice was slightly enhanced (Extended Data Fig. 6h). 
Moreover, we applied the same assay to tamoxifen-treated mixed bone 
marrow chimaeric mice receiving both Ascl2“"/ERT2-Cre and Ascl2*/*/ 
ERT2-Cre bone marrow cells, and found a ~50% decrease in T py; cells 
in the Ascl2“/ERT2-Cre counterpart, whereas no defect was found in 
Ascl2-ablated CD8* T cells (Extended Data Fig. 6i, j). 

Because Ascl2 can form heterodimers with three other bHLH family 
members, including E2-2 (also known as TCF4), E47 (also known as 
TCF3) and HEB (also known as TCF12) in human cells*”’, this sug- 
gests that the partial defect in Tpy,-cell differentiation in Ascl2“"/CD4- 
Cre mice might be caused by compensation from other bHLH members. 
Indeed, Ty; cells and germinal centre responses were normal in Ascla 
CD4-Cre mice after immunization with KLH in CFA (data not shown). 
A substantial enhancement of E47 expression was noticed in naive T, 
Try and even non-Tpy cells from Ascl2“"/CD4-Cre mice compared 
with control mice (Extended Data Fig. 7a), and expression of E47 also 


increased CXCRS5 expression in CD4° T cells (Extended Data Fig. 7b), 
consistent with a recent report that increased E47 activity in Id3-/~ 
mice is associated with increased Tpy-like cells”. Furthermore, we 
examined T-cell maturation and Tpy-cell differentiation in chimaeric 
mice receiving both Ascl2“"/CD4-Cre and Ascl2‘’* /CD4-Cre, or only 
Ascl2“"/CD4-Cre bone marrow cells. As shown in Extended Data 
Fig. 7c, d, in chimaeric mice, Ascl2 deficiency reduced TCR-B"CD69"° 
mature T cells, whereas mice receiving only Ascl2“"/CD4-Cre cells had 
fewer defects in T-cell maturation. Accordingly, after immunization with 
KLH/CFA, Tyy-cell generation was inhibited in the Ascl2“"/CD4-Cre 
counterpart from mixed chimaeric mice (Extended Data Fig. 7e-g). 
These data suggest that Ascl2 deletion at an early developmental stage 
induces compensatory mechanisms to allow T-cell maturation. 

To overcome the inducible compensation mechanism, we deleted 
the Ascl2 gene acutely using a Cre-expressing retrovirus and examined 
Try-cell differentiation. As shown in Fig. 4b, Ascl2 deficiency in this 
case resulted in an absolute impairment in Tp; development in vivo. 
By contrast, Bcl6-deficient OT-II cells showed intact CKCR5 express- 
ion and homing ability at day 3 after immunization (Extended Data 
Fig. 8a—e). Moreover, Id3 overexpression impaired Tpy1-cell generation 
and germinal centre responses (Fig. 4c and Extended Data Fig. 9a-c). 
Conversely, Id3 deficiency enhanced the Ty-cell population (Fig. 4d). 
Together with the observation that Bcl6 overexpression could not res- 
cue Id3-induced Ty blockage (Extended Data Fig. 9d), Ascl2 seems to 
have an earlier function than Bcl6 in Tpy-cell development. 
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To confirm this, we examined T;y;-cell differentiation in either mixed 
chimaeric mice receiving equal numbers of Ascl2“"/ETR2-Cre and 
Ascl2*/* /ETR2-Cre bone marrow cells (Fig. 4e), or intact Ascla’“ry 
ETR2-Cre mice (data not shown). After tamoxifen treatment and KLH/ 
CFA immunization, Ascl2 deletion inhibited T},;-cell differentiation 
and germinal centre response in vivo (Fig. 4e and data not shown). 
Therefore, these data verify that Ascl2 is intrinsically necessary for 
Tpy-cell differentiation. 

We have identified a new player—the Ascl2 transcription factor— 
that is crucial for Tyy;-cell development and function. On the one hand, 
similar to Bcl6, Ascl2 acts as a novel suppressor of Ty1-, Ty2- and 
Ty17-cell differentiation. On the other hand, Ascl2 uniquely regulates 
Trx-cell migration and development by increasing CKCR5 and CXCR4 
expression, and suppressing CCR7 and PSGLI expression, and IL-2 
signalling. Our data indicate that Ascl2 and Bcl6 act to program TyyH- 
cell generation. Activated T cells by antigen-presenting cells gain Ascl2 
expression, which allows their migration towards B-cell follicles. At the 
T-B border, cognate B cells provide another signal for precursor Ty 
cells to increase Bcl6 expression, which completes Ty polarization and 
germinal centre formation. Therefore, Id3 and Ascl2 may serve as an 
early checkpoint during T-cell development in promoting appropri- 
ate antibody responses to infection while keeping autoimmune dis- 
eases in check (Extended Data Fig. 10). Further investigation into this 
axis may offer new ways to modulate antibody responses in infection 
and autoimmunity. 


METHODS SUMMARY 


Mice, immunization and infection. Ascl2”" mice were generated previously° and 
had been backcrossed with C57BL/6 for at least six generations. Mice were immu- 
nized with antigens emulsified in CFA subcutaneously (100 pl each mouse). Influenza 
virus A/Puerto Rico/8 (PR8, H1N1) was purchased from Charles River Laboratories. 
Mice were anaesthetized by intraperitoneal injection with ketamine and infected 
intranasally with a dose of PR8 influenza virus” lethal to 50% of animals tested 
(LDs0) of 0.5. 

ChIP-seq. ChIP-seq was performed as described previously~*. The DNA fragments 
were sequenced with an Illumina 1G Analyzer at the Institute for System Biology. 
The output of the Solexa Analysis Pipeline was converted to browser-extensible 
data (BED) files, and the data were viewed in the UCSC genome browser. 
Statistics. Unless specifically indicated, comparison between different groups was 
conducted with Student’s t-tests. All P values below 0.05 were considered signifi- 
cant. Statistical analysis was performed with Graphpad Prism 6. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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In cancer patients, visual identification of sentinel lymph nodes 
(LNs) is achieved by the injection of dyes that bind avidly to endo- 
genous albumin, targeting these compounds to LNs, where they are 
efficiently filtered by resident phagocytes’. Here we translate this 
‘albumin hitchhiking’ approach to molecular vaccines, through the 
synthesis of amphiphiles (amph-vaccines) comprising an antigen 
or adjuvant cargo linked to a lipophilic albumin-binding tail by a 
solubility-promoting polar polymer chain. Administration of struc- 
turally optimized CpG-DNA/peptide amph-vaccines in mice resulted 
in marked increases in LN accumulation and decreased systemic 
dissemination relative to their parent compounds, leading to 30- 
fold increases in T-cell priming and enhanced anti-tumour efficacy 
while greatly reducing systemic toxicity. Amph-vaccines provide a 
simple, broadly applicable strategy to simultaneously increase the 
potency and safety of subunit vaccines. 

A major challenge in the development of subunit vaccines is the 
efficient delivery of antigen/adjuvant to secondary lymphoid organs, 
where immune responses are orchestrated**. Attempts to enhance 
vaccine delivery have included the use of depot-forming adjuvants’ or 
nanoparticulate carriers that are preferentially internalized by antigen 
presenting cells (APCs)**’, but approaches that could use well-defined 
molecular conjugates would be attractive. Antigens conjugated to 
antibodies targeting dendritic cells reach these cells in the draining 
LNs but also drain into the systemic circulation and access APCs in 
distal tissues'**, which might promote tolerance unless inflammatory 
adjuvants are systemically co-administered. LN targeting is also required 
for cancer staging in sentinel LN mapping procedures, in which radio- 
active or coloured dyes are injected at tumour resection sites’. Compounds 
that bind avidly to serum albumin are particularly effective LN tracers’; 
albumin binding targets these molecules to lymphatics and draining 
LNs, where they accumulate in APCs'*’’. Inspired by this strategy, we 
set out to create LN-targeting molecular vaccines designed to similarly 
‘hitchhike’ on albumin to LNs. Exploiting albumin’s role as a fatty 
acid transporter, we proposed that antigens/adjuvants modified with a 
lipophilic albumin-binding domain would accumulate in lymphoid 
organs after injection, through in situ complexation and transport 
with endogenous albumin. To develop this strategy, we studied model 
vaccines composed of peptide antigens and CpG DNAs, single-stranded. 
oligonucleotides containing unmethylated cytosine-guanine motifs 
that bind Toll-like receptor 9 (TLR9) and serve as potent molecular 
adjuvants’”"*. 

To identify an optimal albumin-binding domain that could be appended 
to either CpG or peptide antigens, we constructed a series of amphi- 
philic 20-base phosphorothioate (PS)-stabilized CpG oligonucleotides 
5'-linked to lipophilic tails (amph-CpGs; Fig. 1a; for synthesis details 
see Supplementary Information). We first evaluated the interaction of 
fluorescein amidite (FAM)-labelled conjugates with serum proteins 
by size-exclusion chromatography (SEC; Fig. 1b). Fetal bovine serum 


(FBS) exhibited a major protein fraction eluting at 5.3 min in SEC that 
coincided with albumin (Extended Data Fig. 1a). The vast majority of 
mono-acyl-conjugated (C18-CpG) or cholesterol-conjugated (Cho-CpG) 
oligonucleotides eluted as monomers at 5.8 min in the presence or 
absence of serum, indicating a lack of interaction with albumin (Fig. 1b 
and Extended Data Fig. 1a). By contrast, diacyl lipid-conjugated CpGs 
(lipo-CpGs) in aqueous solution eluted as micelles (3.7 min), but, after 
incubation with serum, nearly 50% of the lipo-CpGs co-migrated with 
albumin (Fig. 1b). Biotinylated lipo-CpG (but not CpG) incubated 
with FBS and then captured with magnetic beads was found to pull 
down albumin, and lipo-CpG was efficiently captured by albumin- 
conjugated agarose (Extended Data Fig. 1b, c). Biolayer interferometry 
and spectroscopy measurements of fluorescence resonance energy trans- 
fer (FRET) between lipo-CpG and purified albumin further confirmed 
their molecular association in solution (Extended Data Fig. 1d, e). 

We next characterized the in vivo trafficking of CpG conjugates. 
Amph-CpGs were injected subcutaneously in C57BL/6 mice, and 24h 
later, draining LNs were excised intact for in vivo imaging system 
(IVIS) fluorescence imaging. C18- and Cho-CpG showed marginally 
increased uptake in LNs relative to unmodified CpG, reaching levels 
similar to CpG delivered by two prototypical vaccine vehicles known 
to enhance vaccine accumulation in LNs, incomplete Freund’s adjuv- 
ant (IFA) or poly(ethylene glycol) (PEG)-coated liposomes® (Fig. 1c). 
By contrast, lipo-CpG accumulated eightfold more than soluble CpG 
(Fig. 1c). Over 7 days after injection, soluble CpG exhibited no LN 
accumulation above 0.3% of the injected dose at any time, whereas 
lipo-CpG accumulated for 3 days, giving a 12-fold greater area under 
the curve (AUC) for total LN exposure to CpG (Fig. 1d). LN accu- 
mulation was not dependent on TLR9-recognized CpG motifs, and 
was not due to increased nuclease resistance of lipid-modified PS- 
backbone CpGs (data not shown). 

Our in vitro analysis indicated that lipo-CpG molecules equilibrated 
between micellar and albumin-bound forms in the presence of serum, 
making it unclear which state was responsible for LN targeting. However, 
injection of lipo-CpG pre-incubated for 5 h with freshly isolated 90 vol% 
mouse serum at 37 °C, conditions in which SEC showed the vast majo- 
rity of the amphiphile co-migrated with albumin, led to essentially 
identical LN targeting compared with direct lipo-CpG injections (data 
not shown). To address this question further, we introduced poly- 
guanine repeats between the diacyl lipid and CpG sequence to lock 
the amphiphiles in the micellar state and block disassembly by albumin. 
G-quadruplex hydrogen bonding between adjacent oligonucleotide 
strands in lipo-G,,-CpG micelles containing four or more guanine repeats 
rendered the micelles stable in the presence of serum (Extended Data 
Fig. 2a—d). Labelled albumin did not co-migrate with stabilized micelles, 
as assessed by SEC (Extended Data Fig. 2e), suggesting that the amphi- 
philes do not interact with albumin as intact micelles. Despite forming 
micelles with similar sizes (Extended Data Fig. 2f), G-quartet-stabilized 
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Figure 1 | Design of an LN-targeted molecular 
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lipo-G,4-CpG or lipo-Gg-CpG micelles exhibited poor LN accumula- 
tion after injection compared with albumin-binding lipo-CpG and 
lipo-G-CpG (Fig. 1d, e). A possible explanation is that amplification 
of nonspecific matrix binding by the PS-DNA backbones’””® in the 
multivalent micellar form irreversibly trapped the stabilized micelles at 
the injection site. Histological sections of draining LNs showed little 
detectable CpG or lipo-G,-CpG, whereas albumin-binding lipo-CpG 
and lipo-G,-CpG accumulated in the subcapsular sinus and interfolli- 
cular areas (Fig. 1f). LN-accumulating amphiphiles were associated 
with F4/80* macrophages and CD11c* dendritic cells, with only a 
minor contribution from skin-derived CD207* dendritic cells (Fig. 1g 
and Extended Data Fig. 3). If albumin hitchhiking mediates LN tar- 
geting, then covalent conjugation of oligonucleotides to albumin should 
also enhance LN accumulation. We found that injection of CpG con- 
jugated to mouse serum albumin (MSA) led to slightly lower oligonu- 
cleotide uptake in LNs than lipo-CpG, but much greater accumulation 
than soluble CpG (Extended Data Fig. 4). Altogether, these data sug- 
gest that efficient LN delivery of CpG oligonucleotides conjugated to 
lipophilic tails is enhanced by partitioning of oligonucleotides from 
micelles into a serum albumin-bound state. 

To identify potential differences in the function of CpG versus 
optimally LN-targeted lipo-G,-CpG beyond altered biodistribution, 
we assessed several aspects of CpG bioactivity in vitro: lipo-G)-CpG 
was internalized by dendritic cells into endolysosomes in a pattern 
indistinguishable from CpG in confocal microscopy, albeit to twofold 
higher levels (Extended Data Fig. 5a, b). Lipo-G»-CpG did not activate 
the lipid-binding receptor TLR2 in reporter cells, but both free CpG and 
amph-CpG activated RAW macrophages bearing a NF-kB reporter in 
a CpG sequence-specific manner (Extended Data Fig. 5c, d). When 
dendritic cells were activated with CpG or lipo-G,-CpG and pulsed 
with OVA protein to test cross-presentation of antigen to OT-I (OVA- 
specific) T-cells, dendritic cell activation by soluble or amphiphile- 
CpG led to similar T-cell proliferation (Extended Data Fig. 5e). Thus, 
lipid modification of CpG increased uptake in the presence of serum 
but did not otherwise greatly alter the bioactivity of CpG. 
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To determine the impact of LN targeting on the immune response, 
we immunized mice with ovalbumin protein (OVA) mixed with unmodi- 
fied CpG, Cho-CpG, CpG emulsified in IFA or lipo-G,,-CpGs (n = 0, 
2, 4, 6). OVA (which has only 13% sequence identity with albumin) 
showed minimal association with lipo-CpGs (data not shown) and thus 
these vaccinations assessed the impact of LN targeting of the adjuvant 
only, relying on normal lymphatic drainage of the OVA antigen. Lipo- 
G,-CpG-adjuvanted vaccines primed significantly increased frequen- 
cies of antigen-specific, cytokine-producing CD8* T cells compared 
with unmodified CpG, Cho-CpG or CpG in IFA, but the strongest 
responses (up to 32-fold greater than unmodified CpG) were elicited 
by lipo-CpG and lipo-G,-CpG (Fig. 2a, b and Extended Data Fig. 6a, b). 
Notably, the magnitude of the T-cell response was strongly correlated 
with LN accumulation of CpG (Fig. 2c). Lipo-CpG also modestly increased 
antibody responses by approximately threefold and enhanced CD8* 
T-cell responses to the model HIV antigen simian immunodeficiency 
virus (SIV) Gag (Extended Data Fig. 6c, d). Importantly, no antibodies 
against albumin were detected for any of the amph-CpG vaccines 
(Extended Data Fig. 6e, fand data not shown). Control immunizations 
with non-TLR agonist lipo-GpC or diacyl-PEG conjugates (lipo-PEG) 
mixed with OVA were ineffective and amph-CpG responses were identical 
in Tlr2-‘~ mice (Extended Data Fig. 6g, h), ruling outa direct adjuvant 
effect of the diacyl lipid tail. CpG that is not captured in local LNs 
drains to the systemic circulation, leading to systemic inflammatory 
toxicities’. Despite lymphadenopathy of draining LNs, indicating 
local activity, repeated injections of lipo-CpGs showed greatly reduced 
systemic inflammation relative to free CpG (Fig. 2d, e and Extended 
Data Fig. 7a, b). Although further work will be needed to determine any 
potential autoimmune toxicities related to LN targeting of CpG", these 
results suggest that the LN targeting achieved by amph-CpGs greatly 
enhances the potency of this molecular adjuvant while simultaneously 
lowering acute systemic side effects. 

Synthesis of lipo-CpG is straightforward owing to the solubility 
imparted by the long polar oligonucleotide block, but, depending on the 
amino acid sequence, lipidated polypeptides can be essentially insoluble”. 
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Figure 2 | LN targeting enhances potency while reducing systemic toxicity 
of CpG. a-c, C57BL/6 mice (n = 4-8 per group) were immunized with OVA 
(10 pg) plus CpG (1.24 nmol) on day 0 and day 14; shown are SIINFEKL 
tetramer (a) and intracellular cytokine staining (b) on peripheral blood at 
day 20. c, LN CpG fluorescence correlation with T-cell response. d, Serum 
cytokines after injection (n = 3 per group) of 6.2 nmol CpG. e, Splenomegaly 
(n = 3 per group) assessed on day 6 after three injections of CpG. bw, body 
weight. Scale bar, 1 cm. ***P < 0.001, **P < 0.01, *P < 0.05 by one-way 
ANOVA with Bonferroni post-test. Data show mean = s.e.m. of 2-4 
independent experiments. NS, not significant. 


To generalize this LN targeting strategy to antigens and other potential 
vaccine components, we synthesized peptides linked to a diacyl lipid tail 
via a PEG block chosen to promote conjugate solubility (amph-peptides; 
Fig. 3a). Amph-peptides and lipo-PEGs in water form micelles, but 
these amphiphiles can also insert their diacyl tails into cell membranes. 
We found that lipo-PEG amphiphiles with short PEG blocks exhibited 
preferential plasma membrane insertion when incubated with cells in 
the presence of albumin in vitro (Fig. 3b and Extended Data Fig. 8a, b), 
which might limit transit to LNs on albumin in vivo. However, increas- 
ing the polar block to 48 ethylene glycol units yielded amphiphiles that 
partitioned preferentially into solution while retaining albumin bind- 
ing (Fig. 3b and Extended Data Fig. 8c), consistent with previous 
studies”. This in vitro partitioning directly predicted in vivo draining 
patterns, as lipo-PEG-FAM amphiphiles injected subcutaneously showed 
increasing LN accumulation with increasing PEG block length (Fig. 3c). 
Although optimal immunostimulatory CpGs are ~20 bases, an ana- 
logous trend was observed for DNA amphiphiles as a function of oli- 
gonucleotide length (Extended Data Fig. 8d, e). Like amph-CpGs, the 
structure of the hydrophobic block was also important; whereas lipo-PEG 
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amphiphiles with long diacyl tails (=16 carbons, which exhibit a high 
affinity for albumin) showed intense LN accumulation, shorter lipid 
tails with low affinity for albumin showed low LN accumulation (Fig. 3d). 
On the basis of these findings establishing design rules for efficient 
targeting of lipo-PEG amphiphiles to LNs, we conjugated peptide anti- 
gens to 1,2-distearoyl-sn-glycero-3-phosphoethanolamine-N-PEG (DSPE- 
PEG 2 kDa) to generate amph-peptides for vaccination studies (Extended 
Data Fig. 9a). 

To test the potency of combined antigen and adjuvant targeting to LNs, 
we prepared amph-peptide (DSPE-PEG-peptide) conjugates ofa model 
HIV antigen (AL11 epitope from SIV Gag)”*, the tumour-associated 
self-antigen Trp2 from melanoma”’, and a peptide derived from the 
human papillomavirus (HPV)-derived cervical cancer antigen E7 (ref. 27). 
Amph-peptides accumulated efficiently in LNs (Extended Data Fig. 9b, c). 
C57BL/6 mice immunized with amph-peptides and amph-CpG (lipo- 
G,-CpG) showed markedly increased expansion of antigen-specific, 
cytokine-producing CD8" T-cells and enhanced cytolytic activity relative 
to unmodified peptide/CpG immunizations (Fig. 4a—c). To test whether 
amph-vaccine delivery enhances the protective efficacy of peptide vac- 
cines, animals bearing established TC-1 tumours (expressing the E7 
oncoprotein from HPV) or B16F10 melanomas were vaccinated. Amph- 
vaccines triggered sustained regression of large TC-1 tumours that were 
only modestly affected by soluble vaccines (Fig. 4d) and slowed the 
growth of melanoma tumours, in which a traditional soluble vaccine 
had no effect (Fig. 4e). In addition to enhancing the effectiveness of 
optimal T-cell epitopes, synthetic ‘long peptide’ antigens”* also exhibited 
enhanced immunogenicity when delivered as amph-peptides (Extended 
Data Fig. 10a-e). This result is of particular interest because a finite pool 
of long sequences permits peptide vaccines to provide effective coverage 
of epitopes across the diverse haplotypes of a given target patient popu- 
lation. Amphiphile-long peptides were also approximately tenfold more 
potent than soluble peptides when combined with non-CpG, non-LN- 
targeted alternative TLR adjuvants (Extended Data Fig. 10f), showing that 
CpG is not required for an enhanced response to LN-targeted peptides. 

Altogether, the results presented here define design rules for amphi- 
phile conjugates as a general strategy to enhance the potency and safety 
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Figure 3 | Design of LN-targeted amph-peptides. a, Structure of amph- 
peptides. b, Amph-PEG-fluorescein insertion into cell membranes quantified 
by fluorescence spectroscopy after 1h incubation with splenocytes in the 
presence of 100 uM albumin. c, d, C57BL/6 mice (n = 4 LNs per group) were 
injected with fluorescent amph-PEGs with varying PEG lengths (fixed C18 
diacyl lipid tails; c) or lipid tail lengths (fixed PEG length 48 EG units; d); 
draining LNs were excised and imaged by IVIS after 24h. ***P < 0.001, 
**P< 0.01, *P < 0.05 by one-way ANOVA with Bonferroni post-test. Data 
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Figure 4 | Amph-vaccines maximize the immunogenicity and therapeutic 
efficacy of polypeptide vaccines. a—c, C57BL/6 mice (n = 3-4 per group) were 
immunized with SIV Gag, Trp2, or E7 peptides (10 1g) plus CpG (1.24 nmol) 
on day 0 and day 14; shown are tetramer-positive CD8* T-cells (a) and 
intracellular cytokine production (b) in peripheral blood on day 20. ¢, Trp2- 
specific cytotoxicity measured using an in vivo killing assay on day 21. 

d, e, Tumour growth in C57BL/6 mice (n = 8 per group) inoculated with 

3 X 10° TC-1 (d) or B16F10 (e) tumour cells and vaccinated with CpG plus E7 
peptide or Trp2 peptide (10 1g prime, 20 1g boost), respectively, on days 
indicated by arrows. Statistically significant differences between soluble and 
amph-vaccines are indicated by asterisks: ***P < 0.001, **P < 0.01, *P < 0.05 
by one-way ANOVA with Bonferroni post-test. Data show mean ~ s.e.m. of 
2-4 independent experiments. NS, not significant. 


of LN-active compounds, an approach that may be applicable to a broad 
range of immunomodulatory therapeutics and imaging agents. These 
findings also have implications for how the immune system may survey 
lipophilic antigens. Further work will be needed to determine whether 
albumin binding is functionally critical, or alternatively whether other 
rare serum components may have a role in the observed LN targeting. 


METHODS SUMMARY 


Synthesis of vaccine amphiphiles. Oligonucleotide amphiphiles were synthesized 
using an ABI 394 synthesizer on a 1.0 mol scale. All lipophilic phosphoramidites 
were conjugated as a final ‘base’ on the 5’ end of oligonucleotides”. Antigen amphi- 
philes were synthesized by reacting N-terminal cysteine-modified peptides with 
maleimide-PEG 999-DSPE in dimethyl formamide. 

Immunizations. Six-to-eight week old C57BL/6 mice (female, Jackson Laboratory) 
were immunized with 10 ug of antigen mixed with 1.24nmol CpG adjuvant in 
100 pl of PBS subcutaneously at the base of the tail. All procedures were performed 
under an IUCAC-approved animal protocol in accordance with the guidelines for 
animal care in a Massachusetts Institute of Technology animal facility inspected by 
the US Department of Agriculture. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Jump off the page 


Researchers are learning to embrace online lab notebooks, but not without growing pains. 


BY AMANDA MASCARELLI 


r | “wo years into his PhD, Carl Boettiger 
needed a better way to organize his data 
and synthesize his ideas. Fishing around 

online, he stumbled across chemist Cameron 

Neylon’s open electronic lab notebook. Boet- 

tiger, who was studying mathematical ecology, 

liked what he saw. Neylon, now advocacy 
director at the Public Library of Science in San 

Francisco, California, had pulled back the cur- 

tain on the steps and thought processes behind 

his protocols and research. His data collection, 
protocols and results were linked together and 
available online, making the concepts easy to 
reference and explore. 

Inspired, Boettiger created his own electronic 
notebook, reporting online about his day-to- 
day research in a publicly available wiki that 


is followed by the open-science community. 
Viewers can find the notebook through the 
wiki’s RSS feed and links on social media as 
well as in Google searches, and can post com- 
ments. He soon started to receive suggestions 
about, and valuable feedback on, his research 
and methods from other scientists — mostly 
followers of open science from outside his field 
— and some even led to collaborations. 

Four years on, Boettiger, now a theoretical 
ecologist at the University of California, Santa 
Cruz, is a leader in the use of the open note- 
book. He is convinced of its value, and he is not 
alone — the idea is steadily gaining traction in 
some (but not all) scientific circles. 

Whether on paper or in digital form, lab 
notebooks are meant to document exactly 
what, when and why experiments were done 
(see ‘Record of achievement’) and usually 


osm. 
uhage 


contain much more information than will 
ever be published in an academic paper. They 
can be used as evidence for securing patents, 
to settle legal issues or to pass a project from 
one researcher to another. Industry labs almost 
always require their researchers to maintain 
such records, as do many academic principal 
investigators, and until very recently the infor- 
mation would be kept securely under wraps in 
the lab until it was published. 

Open electronic notebooks are a radical 
departure from this ethos. Data and methods 
are no longer cloistered in books or tucked 
away on private hard drives. Instead, they can 
be shared online for all to see. Some scientists 
might shudder at the thought of anyone but 
lab mates and close collaborators knowing the 
detailed logic and steps behind their research 
projects before publication. But open > 
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> science is becoming more widely accepted 
as technologies change and as younger genera- 
tions of researchers discover alternative tools 
and approaches. 


EMBRACE THE OPENNESS 

Boettiger readily admits that having his entire 
scientific process exposed online at each step 
of the way carries a risk. But “you have to take 
risks to be successful’, he points out. “The 
idea that there's a risk-free way of getting your 
science out there and understood and engag- 
ing collaborators is a myth.” He posts updates 
to his work throughout the day, as if scrib- 
bling notes in a conventional notebook, but 
publishes synthesized analyses and summa- 
ries several times a week or month. Boettiger 
still has some concerns about being scooped 
(see Nature 493, 711; 2013), but says that those 
reservations are mostly offset by the many ben- 
efits of making his science open and accessible 
— such as providing another way to attract col- 
laborators and to gain recognition in his field. 

The gradual acceptance of open science in 
fields such as chemistry, mathematics, neu- 
roscience and ecology is highlighting the 
information-management challenges that 
scientists face. “Now that science has become 
more complicated and not all of the details 
and data fit in the papers we publish, we're at 
a loss,’ says Boettiger. “How do we communi- 
cate exactly what we're doing? How do we keep 
science replicable?” 

Scientists collect, store and analyse data in 
different ways and with a growing array of 
tools. That makes it difficult to compare one 
researcher's results with another’s, and even 
harder to know how to reproduce their results, 
says Carly Strasser, a data-curation and open- 
science specialist at the California Digital 
Library in Oakland. “We still have paper note- 
books, we still have Post-it notes, we still have 
phone calls that we don't transcribe and a ton of 
e-mails going back and forth,” she says. “There 
are just a lot of moving parts. Scientists don't 
ever really learn how to capture the process.” 


The problem is mainly down to a lack of 
training. Undergraduates turn in detailed 
notebooks for relatively simple experiments, 
but as the complexity of their work increases 
when they become graduate students, they 
may struggle to document these larger exper- 
iments. Many labs do not have formalized 
notebook-writing 
conventions in place. 
Strasser says that her 
approach as a PhD 
student in marine 
biology was makeshift 
— cutting and pasting 
computer printouts 
into a paper note- 
book, storing data in 
“chaotic heaps” on 
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tasks, they are turn- 

ing to electronic lab notebooks that have 
myriad bells and whistles and offer ‘workflow’ 
capabilities. These software packages aim to 
capture the entire scientific process — from 
study design to data analysis and visualization 
— in close to real time, and enable colleagues 
to review and replicate the work. Users can 
share data and methods as they develop, or 
choose to wait until publication. 

A piece of software called R, for example, 
is widely used in the ecology community. 
With it, researchers can weave together text, 
programming code and data analysis into a 
narrative. The steps leading to a published 
paper no longer need to be impenetrable to 
other researchers — anyone can access any 
step along the way and reproduce it, or take 
the work in other directions. “These tools 


RECORD OF ACHIEVEMENT 


allow me to explore alternative ideas in a 
structured and documented way without dis- 
rupting my existing work,’ says Karthik Ram, 
a computational ecologist at the University of 
California, Berkeley. 

Between 2009 and 2011, Ram was a post- 
doc at the University of California, Santa 
Cruz, studying the impacts of climate change 
on large mammals in Yellowstone National 
Park in Wyoming. He needed to incorporate 
decades of data from multiple sources — on 
the natural history of herbivores and on how 
migratory behaviour has changed over time, 
for example — and link it all to long-term 
climate data and changes in snowpack and 
vegetation. Because he uses existing data and 
models to answer questions, he often needs 
to understand intermediate steps along the 
way, such as the statistical methods used. Ram 
found it time-consuming and difficult to tease 
apart other researchers’ data sets so that he 
could build on them. Workflow platforms such 
as Rand [Python (see “Iaming the workflow’) 
are now making this type of work much more 
manageable, he says. Other platforms include 
Projects (developed by Digital Science, a sister 
company of Nature Publishing Group). 

Once a workflow system is populated with 
the nitty-gritty of the project, scientists can 
take advantage of Internet tools, such as wikis 
and social media, to post and share their work. 
Jean-Claude Bradley, an organic chemist at 
Drexel University in Philadelphia, Penn- 
sylvania, has several projects in his lab that 
rely on wikis. Any raw data a project obtains, 
including pictures, videos and spectroscopy 
results, are quickly shared among team mem- 
bers, usually within a day. Lab members also 
use the wikis to share their findings. One wiki 
page represents one page in a lab notebook, 
with sections, objectives, methods and other 
components. 

All these tools help to create what open- 
notebook aficionados call a living paper, 
which contains all components of the 
research, from e-mails to conference chatter 


A glossary of electronic notebooks 


There are many ways to build a twenty-first- 
century electronic notebook — and to make 
work accessible to others. 


@ Electronic notebook (open or closed) 

A digital version of the conventional 

paper lab notebook in which the entire 
scientific process can be captured. Unlike 
conventional notebooks, which are difficult 
for others to access, electronic notebooks 
make it easier for researchers to organize, 
manage and share the many components of 
their work. 


@ Workflow software Platforms such as R and 
|Python integrate all the pieces of research into 
a single system. They capture data collection 
and analyses in near real time and these can 
then be shared with colleagues for review or 
collaboration simply by clicking a button. 


@ Wiki Web tools that serve as platforms for 
electronic notebooks. Wikipedia, the online 
encyclopaedia, is one example of a wiki. A 
wiki can be made accessible and modifiable 
by others. Many researchers who practice 
open science use wikis to record their 
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progress and then share it with others. For 
example, a site called OpenWetWare allows 
researchers to build a notebook wiki. 


@ Living paper A term coined by researchers 
to describe the concept of archiving the 
entire scientific process, from phone calls 
and Twitter conversations to methods and 
analysis, in a way that is openly accessible. 

It allows other researchers to view the steps 
leading up to the final products and even 

to build on or derive ideas from the original 
work. A.M. 


CARLY STRASSER 


TAMING THE WORKFLOW 


The open-access platform IPython 


In 2001, as a final-year PhD student 

in theoretical physics, Fernando Perez 
started on a side project. For his 
research on quantum field theory, he 
needed to bring together his hotchpotch 
of computer code and data-analysis 
tools. Using a programming language 
called Python, he created IPython 
(ipython.org), an open-source, integrated 
platform that allowed him to type code, 
run his analyses, plot and visualize his 
data and add rich graphics within a 
single system. 

“When | started working on |Python 
| told myself and my adviser that this 
was just going to be an afternoon hack 
and that | would get back to ‘real work’ 
very soon,’ he says. Some 13 years later, 
Perez is a computational scientist at the 
Brain Imaging Center at the University 
of California, Berkeley, and he develops 
|Python for computational science, 
publication and education across domain 
disciplines as his full-time job. 

In 2011, he, along with collaborator 
Brian Granger at California Polytechnic 
State University in San Luis Obispo and 
their colleagues, added in a web-enabled 
notebook, which has been rapidly 
adopted by computational scientists 


on Twitter, and links them together in an 
openly accessible, easily updatable, digital 
workflow. “A living paper is alive in that 
it gets updated and it reflects the ongoing 
process,’ says Boettiger. It allows scientists 
to document their entire scientific process, 
which they — and whomever they choose 
to share it with — can add to and derive new 
ideas from, says Matthew Jones, an expert in 
informatics at the National Center for Eco- 
logical Analysis and Synthesis at the Univer- 
sity of California, Santa Barbara. 


PRESERVED FOR POSTERITY 
All these records must be stored, and archiv- 
ing not only data and computations, but also 
intellectual discussions, presents challenges. 
Boettiger links his Twitter feed to his open 
notebook, and generates a running online 
tab of his reading and notes through the 
Mendeley reference manager. “Our lit- 
erature has expanded so rapidly that other 
people are often our best sources of being 
able to figure out what to read,” he says. 
Peter Andras, a computer scientist at New- 
castle University, UK, also logs his reading 
through Mendeley and encourages his stu- 
dents to do the same. 

But web-based resources often decline in 


working in the fields of biology, physics 
and neuroscience. It functions like a 
word processor, with normal text and 
formatting, but also enables users to 
insert programming language and rich 
graphics and data analyses, and easily go 
back and forth between them. “It’s like 
having a very powerful calculator in the 
middle of your word processor that can 
do anything programming language can 
do,” says Perez. 

Researchers have even begun to 
publish papers directly from IPython, 
says Perez. The University of California, 
Berkeley, offers courses in IPython, and 
Harvard University and the Massachusetts 
Institute of Technology in Cambridge, and 
Columbia University in New York, among 
others, have adopted it. 

The National Center for Ecological 
Analysis and Synthesis at the University 
of California, Santa Barbara, runs 
workshops on collaborative synthesis and 
data-sharing for graduate students and 
postdocs. And short, on-site workshops 
by a worldwide volunteer group called 
Software Carpentry offer hands-on 
training in Python, R, GitHub and other 
data-synthesis programs at various 
locations. A.M. 


popularity and if that happens, the record 
can languish unseen, or potentially even 
be lost. FriendFeed, for example, a hybrid 
between Twitter and Facebook, had a healthy 
researcher following in 2010 but now has 
substantially fewer users. Phillip Lord, a bio- 
informatician anda collaborator of Andras, 
says that for security he archives digital files 
through both archive.org, a general web 
repository, and an initiative at the British 
Library in London. Archiving services act as 
networks of libraries, storing data in multiple 
places to ensure that if one copy were lost or 
destroyed, it could be retrieved elsewhere. 

Such challenges make it difficult to main- 
tain electronic and open notebooks — but 
they are unlikely to stop their increasing 
adoption. “We’re moving away from the 
science that we can document with a pen and 
paper to the science that we do in six different 
venues on instruments and with GPSs and 
laptops and paper and pen,’ says Strasser. 
“How do we capture that? It’s super hard” 
Even so, the advantages are hard to ignore. 
“The days of paper lab books,” Lord says, “are 
well past their best.” = 


Amanda Mascarelli is a freelance writer 
based in Denver, Colorado. 
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GRANTS 
Gender differences 


Women are less successful than men in 
getting grants from Research Councils 
UK (RCUK), according to an 11 March 
report from the umbrella body for the UK 
government’ research councils. RCUK’s 
first such analysis compared the annual 
success rate during 2010-13 of grant 
applications from women and men in 
eight age groups. Across all age and grant 
categories, women averaged a 25% success 
rate, compared with men’s 29%. Iain 
Cameron, head of research careers and 
diversity at RCUK, says that the reason for 
women’s lower success is not clear. One 
council is launching unconscious-bias 
training for peer reviewers, and RCUK 
may do a follow-up analysis. “The point of 
publishing these data is to expose them,” 
Cameron says. 


MOBILITY 
Swiss frozen out 


Early-career researchers in Switzerland 
will have limited access to the Erasmus 
Plus student-exchange programme and 
other European research funds for the 
next academic year. Switzerland lost its 
status as an Erasmus Plus programme 
country after it voted to impose quotas on 
immigrants from European Union (EU) 
countries that restrict the free movement 
of people from Croatia, the newest EU 
member. The change means that Swiss 
nationals who wish to study in the EU will 
no longer be eligible for Erasmus grants 
and that students from eligible countries 
will not be able to obtain an Erasmus 
grant to study in Switzerland. Erasmus 
funds around 7,000 exchanges a year 
between Switzerland and other countries. 


SALARIES 


Public boost 


Pay rises were slightly higher for faculty 
members at US public institutions (2.2%) 
than at private institutions (2.0%) for the 
first time since 2009, finds the College 
and University Professional Association 
for Human Resources (CUPA-HR) 

in Knoxville, Tennessee. The change 
suggests an overall strengthening of 

state economies, says Andy Brantley, 
CUPA-HR president. CUPA-HR’s poll 

of 792 institutions and 180,000 faculty 
members also showed that compensation 
for tenure-track and tenured faculty 
members across both university categories 
rose by an average of 2.1% this academic 
year, the same increase as the year before. 
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BY GEORGE ZEBROWSKI 


s usual, I would have to leave 
Ate I arrived. Memory 
threw me there, exiling me 
from all my other times with her, no 
matter how much I concentrated. 
She was alive in all our pasts, but only 
these earlier presents were open to me. 
“Me again?” she asked in our pre- 
sent, jealous and prideful. 

“No one else,” I said. 

I kissed her before I bridged, fight- 
ing off Maxim Gorky’s claim that 
“love is the failure of mind to under- 
stand nature” If so, then love was an 
opposed way, an uphill fight at best. 

She was asleep back there in our 
off-campus apartment, as I came up 
by train from New York City. I would 
have just enough time to get there and 
spend some time with her before the 
train arrived. 

I always prepared by losing a pound 
or two, colouring my hair a bit and 
exercising, even using some make-up 
to look younger than my late 60s, so 
that she would not notice in the dim 
light of the apartment at night. Near- 
sighted and in bed, it helped that she 
would not be wearing her glasses. 

I grasped my key from decades 
past, and summoned the vision of 
the pale-skinned young woman who had 
dyed her hair black after a silly blonde 
experiment, and then cut it short when I 
was away. I would again compliment the 
change. 

My appearance at my old door shook the 
back porch for a moment. I stood before the 
curtained glass, but no light went on. I was 
fearful that this might be my last chance to 
regain this time, so I had to make it count. 
Other times with her might open to me if 
this one closed, but that was far from certain. 

The theory of jumps was not perfect. 
There might not even be any real time dis- 
placement at all, but instead a reality-like 
recreation of significant memories that 
suddenly occupied a mind with a quantum 
flood of insistence to the point at which it 
made no difference to the experience; it 
might just as well be happening in the naive 

sense. Time probably 
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Time well spent. 


stubborn duration, inexpressible endurance 
beyond time-like words. 

I turned my key in the lock, pushed the 
door open and went in, closing it behind me. 

“Who's there?” asked her voice, from 
somewhere inside me, it seemed. 

“Tt’s me,” I said, hoping to sound younger. 

“Oh,” she said uncertainly. 

Icrossed the small living room to the dou- 
ble bed in the alcove and sat down. Her head 
came out from beneath the covers, hair cut 
short and dyed black. She looked up at me 
like a queen on a divan. 

“Beautiful!” I said, and she giggled as I lay 
down beside her. 

“You must be tired,” she said as I sighed. 
“It's okay, we can just sleep,” she whispered. 
“We have all tomorrow.” 

She dozed. I lay there, afraid to disappoint 
her. 

After a while I looked at my watch and 
saw that I would soon arrive, and it would 
not matter. Were anachronisms real or only 
apparent? You can have all the anachronisms 
you want in your mind, where they happen 
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all the time. Was I asleep up ahead? I 
felt a rocking sense of loss as I heard 
the train whistle. 

“What is it?” she asked softly as 
my moments with her fled into some 
deep abyss where I could not follow. 

“Tl be right back,” I said and got up. 
She turned over and closed her eyes, 
expecting that I was only undressing. 
I stood there, looking at the glory of 
her bare back. To kiss her now might 
be fatal if I arrived early. 

I went out the door and breathed 
the night air, knowing that I was com- 
ing down the street, and that I could 
leave it all to him. 

The starry night was blue. I went 
across the yard and stood by the brick 
wall of the garage. I would come 
through the narrow alleyway from the 
street beyond, less than a block from 
the train station. 

My memory moved within him. I 
was waiting here only to see him walk 
by in the dark. The unreality of time 
seized me, and seemed about to inflict 
a wretched pain, but relented. 

Time could show mercy if you 
remembered well enough, a collec- 
tive delusion, the setting of human 
psychology, biologically based, to 
conceal the fact that everything was 
in one place and happened all at once. 

I would return, inexactly, to this very time 
and place as often as my force of memory 
struck out against loss. I looked around the 
dark yard. I was present here in these shad- 
ows more than once, similarly aware, only 
moments apart. 

He came through the dark alleyway, and I 
felt the flow of love for his waiting beloved; 
love then, not my love now. His step was 
sure, the past his own, his youth holding 
back the incoming future. The same key I 
still held was in his pocket. 

He would not disappoint the dark-haired 
goddess in her bed. 

I was gone before I got to the door. m 
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